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Chapter One 


The algebra and calculus 
of vectors 


KEY FEATURES 

The key features of this chapter are the rules of vector algebra and differentiation of vector 
functions of a scalar variable. 


This chapter begins with a review of the rules and applications of vector algebra. 
Almost every student taking a mechanics course will already have attended a course on 
vector algebra, and so, instead of covering the subject in full detail, we present, for easy 
reference, a summary of vector operations and their important properties, together with a 
selection of worked examples. 

The chapter closes with an account of the differentiation of vector functions of a 
scalar variable. Unlike the vector algebra sections, this is treated in full detail. Applica¬ 
tions include the tangent vector and normal vector to a curve. These will be needed in 
the next chapter in order to interpret the velocity and acceleration vectors. 

1.1 VECTORS AND VECTOR QUANTITIES 

Most physical quantities can be classified as being scalar quantities or vector 
quantities. The temperature in a room is an example of a scalar quantity. It is so called 
because its value is a scalar, which, in the present context, means a real number. Other 
examples of scalar quantities are the volume of a can, the density of iron, and the pressure 
of air in a tyre. Vector quantities are defined as follows: 

Definition 1.1 Vector quantity If a quantity Q has a magnitude and a direction asso¬ 
ciated with it, then Q is said to be a vector quantity. [Here, magnitude means a positive 
real number and direction is specified relative to some underlying reference frame* that 
we regard as fixed.] 

The displacement of a particle^ is an example of a vector quantity. Suppose the 
particle starts from the point A and, after moving in a general manner, ends up at the 


* See section 2.2 for an explanation of the term ‘reference frame’. 

' A particle is an idealised body that occupies only a single point of space. 
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a 



FIGURE 1.1 Four different representations of each of the 
vectors a. b c form the twelve edges of the parallclopipcd 
box. 


point B. The magnitude of the displacement is the distance AB and the direction of the 
displacement is the direction of the straight line joining A to B (in that order). Another 
example is the force applied to a body by a rope. In this case, the magnitude is the strength 
of the force (a real positive quantity) and the direction is the direction of the rope (away 
from the body). Other examples of vector quantities are the velocity of a body and the 
value of the electric (or magnetic) field. In order to manipulate all such quantities without 
regard to their physical origin, we introduce the concept of a vector as an abstract quantity. 

Definition 1.2 Vector A vector is an abstract quantity characterised by the Wo proper¬ 
ties magnitude and direction. Thus Wo vectors are equal if they have the same magnitude 
and the same direction.* * 

Notation. Vectors are written in bold type, for example a, b, r or F. The magnitude of 
the vector a, which is a real positive number, is written | a |, or sometimes^ simply a. 

It is convenient to define operations involving abstract vectors by reference to some 
simple, easily visualised vector quantity. The standard choice is the set of directed line 
segments. Each straight line joining two points ( P and Q say, in that order) is a vector 
quantity, where the magnitude is the distance P Q and the direction is the direction of 

Q relative to P . We call this the line segment PQ and we say that it represents some 
abstract vector a} Note that each vector a is represented by infinitely many different line 
segments, as indicated in Figure 1.1. 


* In order that our set of vectors should have a standard algebra, we also include a special vector whose 
magnitude is zero and whose direction is not defined. This is called the zero vector and written 0. The 
zero vector is not the same thing as the number zero! 

7 It is often useful to denote the magnitudes of the vectors a , b. c, ...by a, b, c, ..., but this does risk 
confusion. Take care! 

* The zero vector is represented by line segments whose end point and starting point are coincident. 
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FIGURE 1.2 Addition, subtraction and scalar multiplication of vectors. 


1.2 LINEAR OPERATIONS: a + b AND Xa 

Since vectors are abstract quantities, we can define sums and products of vectors 
in any way we like. However, in order to be of any use, the definitions must create some 
coherent algebra and represent something of interest when applied to a range of vector 
quantities. Also, our definitions must be independent of the particular representations 
used to construct them. The definitions that follow satisfy all these requirements. 


The vector sum a + b 

Definition 1.3 Sum of vectors Let a and b be any two vectors. Take any representa¬ 
tion P Q of a and suppose the line segment QR represents b. Then the sum a + b of a 
and b is the vector represented by the line segment P R, as shown in Figure 1 .2 (left). 


Laws of algebra for the vector sum 


(i) b + a = a + b 

(commutative law) 

(ii) a + (b + c) = (a + b) + c 

(associative law) 


Definition 1.4 Negative of a vector Let b be any vector. Then the vector with the same 
magnitude as b and the opposite direction is called the negative of b and is written — b . 
Subtraction by b is then defined by 


a- b = a + (- b ). 

[That is, to subtract b just add — b, as shown in Figure 1.2 (centre).] 

The scalar multiple Xa 

Definition 1.5 Scalar multiple Let a be a vector and X be a scalar (a real number). 
Then the scalar multiple ka is the vector whose magnitude is \ k || a \ and whose direction 
is 
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(i) the same as a ifX is positive, 

(ii) undefined ifX is zero (the answer is the zero vector), 

(iii) the same as —a if X is negative. 

It follows that — (A .a) = (—X)a. 


Laws of algebra for the scalar multiple 

(i) X(pa) = ( Xp)a (associative law) 

(ii) X(a + b) — Xa + Xb and (A. + p)a — Xa + pa (distributive laws) 


The effect of the above laws is that linear combinations of vectors can be manipu¬ 
lated just as if the vectors were symbols representing real or complex numbers. 

Example 1.1 Laws for vector sum and scalar multiple 

Simplify the expression 3(2 a — 4b) — 2(2 a — b). 

Solution 

On this one occasion we will do the simplification by strict application of the laws. It 
is instructive to decide which laws are being used at each step! 

3(2 a -4b) -2(2a - b) = 3( 2 a + (-4)ft) + (—2)(2a + (-l)ft) 

= (6a+(—12)ft)+ ((—4)a + 2ft) 

= ^6 a + (—4)o) + ^( —12) ft + 2 ft) 

= 2a + (—10)ft =2 a — 10ft. ■ 


Unit vectors 

A vector of unit magnitude is called a unit vector. If any vector a is divided by its own 
magnitude, the result is a unit vector having the same direction as a. This new vector is 
denoted by a so that 


a = a/| a |. 


Basis sets 

Suppose a and ft are two non-zero vectors, with the direction of ft neither the same nor 
opposite to that of a. Let OA, OB be representations of a , ft and let V be the plane 
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V 



FIGURE 1.3 The set {a. b\ is a basis for all vectors lying in the plane OAR. 

containing the triangle OAB. Then (see Figure 1.3) any vector v whose representation 
O V lies in the plane V can be written in the form 

v = Xa + pb, (1.1) 

where the coefficients X, p are unique. Vectors that have their directions parallel to the 
same plane are said to be coplanar. Thus we have shown that any vector coplanar with a 
and b can be expanded uniquely in the form (1.1). It is also apparent that this expansion 
set cannot be reduced in number (in this case to a single vector). For these reasons the pair 
of vectors {a, b} is said to be a basis set for vectors lying* in the plane V. 

Suppose now that {a. b, c} is a set of three non-coplanar vectors. Then any vector v, 
without restriction, can be written in the form 

v = Xa + pb + vc, (1.2) 

where the coefficients X, p, v are unique. In this case we say that the set {a, b, c} is a basis 
set for all three-dimensional vectors. Although any set of three non-coplanar vectors forms 
a basis, it is most convenient to take the basis vectors to be orthogonal unit vectors. In this 
case the basis set* is usually denoted by {*', j, k) and is said to be an orthonormal basis. 
The representation of a general vector v in the form 

v = X i + pj + v k 


is common in problem solving. 

In applications involving the cross product of vectors, the distinction between right- 
and left-handed basis sets actually matters. There is no experiment in classical mechanics 
or eletromagnetism that can distinguish between right- and left-handed sets. The differ¬ 
ence can only be exhibited by a model or some familiar object that exhibits ‘handedness’, 
such as a corkscrew.* Figure 1.4 shows a right-handed orthonormal basis set attached 
to a well known object. 


* Strictly speaking vectors are abstract quantities that do not lie anywhere. This phrase should be taken to 
mean ‘ vectors whose directions are parallel to the plane V'. 

' It should be remembered that there are infinitely many basis sets made up of orthogonal unit vectors. 

? Suppose that the non-coplanar vectors {a. b. c } have representations OA, OB, OC respectively. Place 
an ordinary corkscrew with the screw lying along the line through O perpendicular to the plane OAB, 
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FIGURE 1.4 A standard basis set {«, j\ k] is 
both orthonormal and right-handed. 



FIGURE 1.5 The points A, B , C have position 
vectors a. ft, c relative to the origin O. 


A 



Definition 1.6 Standard basis set If an orthonormal basis {i,j,k} is also right ■ 
handed (as shown in Figure 1.4), we will call it a standard basis. 


Position vectors and vector geometry 

Suppose that O is a fixed point of space. Then relative to the origin O (and relative to the 
underlying reference frame), any point of space, such as ,4, has an associated line segment, 

O A, which represents some vector a. Conversely, the vector a is sufficient to specify the 
position of the point A. 

Definition 1.7 Position vector The vector a is called the position vector of the point 
A relative to the origin O. [It is standard practice, and very convenient, to denote the 
position vectors of the points A, B ,C , ...by a, b, c, and so on, as shown in Figure 1.5.] 

Since vectors can be used to specify the positions of points in space, we can now use 
the laws of vector algebra to prove* * results in Euclidean geometry. This is not just an aca¬ 
demic exercise. Familiarity with geometrical concepts is an important part of mechanics. 
We begin with the following useful result: 


and the handle parallel to CM. Now turn the corkscrew until the handle is parallel to OB and note the 
direction in which the corkscrew would move if it were k in action’. ( The direction of the turn must be 
such that the angle turned through is at most 180°.) If OC makes an acute angle with this direction, the 
set [a. ft. c} (in that order) is right-handed ; if OC makes an obtuse angle with this direction then the set 
is left-handed. 

* Some properties of Euclidean geometry have been used to prove the laws of vector algebra. However, 
this does not prevent us from giving valid proofs of other results. 
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FIGURE 1.6 The point X divides the line AB 
in the ratio k : /x. 



Example 1.2 Point dividing a line in a given ratio 


The points A and B have position vectors a and b relative to an origin 0. Find the 
position vector x of the point X that divides the line AB in the ratio A. : /x (that is 
AX/XB = k//x). 


Solution 

It follows from Figure 1.6 that x is given by* 


x — o + AX = u + 


k 

k + ix 


= a + 



(b - a) 


AB 

fx u T a b 
k + [X 


In particular, the mid-point of the line AB has position vector ^(a + b). ■ 


Example 1.3 Centroid of a triangle 


Show that the three medians of any triangle meet in a point (the centroid) which 
divides each of them in the ratio 2:1. 

Solution 

Let the triangle be ABC where the points A, B, C have position vectors a, b , c 
relative to some origin O. Then the mid-point P of the side BC has position vector 
p = ^(b + c). The point X that divides the median A P in the ratio 2:1 therefore has 
position vector 

a + 2p a + b+ c 

x = -= -. 

2+1 3 

The position vectors of the corresponding points on the other two medians can be 
found by cyclic pennutation of the vectors a, b, c and clearly give the same value. 
Hence all three points arc coincident and so the three medians meet there. ■ 


* Strictly speaking we should not write expressions like a+ AX since the sum we defined was the sum 
of two vectors, not a vector and a line segment. What we really mean is ‘ the sum of a and the vector 

represented by the line segment AX'. Pure mathematicians would not approve but this notation is so 
convenient we will use it anyway. It’s all part of living dangerously! 
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FIGURE 1.7 The bisector theorem: 
AP/PB = OA/OB. 


O 



Example 1.4 The bisector theorem 

In a triangle O AB , the bisector of the angle AOB meets the line AB at the point P. 
Show that AP/PB = OA/OB. 

Solution 

Let the vertex O be the origin of vectors* and let the position vectors of the vertices 
A, B relative to O be a, b as shown in Figure 1.7. The point with position vector 
a + h does not lie the bisector OP in general since the vectors a and b have different 
magnitudes a and b. However, by symmetry, the point with position vector a + b does 
lie on the bisector and a general point X on the bisector has a position vector jc of the 
form 



where K = ab/k is a new constant. Now X will lie on the line AB if its position 
vector has the form {pa + kb)/{ k + p). that is, if A" = a + b. Hence the position 
vector p of P is 

b u -\- o b 

Moreover we see that P divides that line AB in the ratio a : b, that is , AP/PB = 
OA/OB as required. ■ 

1.3 THE SCALAR PRODUCT a • b 

Definition 1.8 Scalar product Suppose the vectors a and b have representations O A 
and OB. Then the scalar product a • b of a and b is defined by 

a • b = |a||£|cos0, (1.3) 

where 0 is the angle between O A and OB. [Note that a - b is a scalar quantity.] 


* One can always take a special point of the figure as origin. The penalty is that the symmetry of the 
labelling is lost. 
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Laws of algebra for the scalar product 


(i) b • a = a • b (commutative law) 

(ii) a'(b+c) = a'b + d'C (distributive law) 

(iii) (A .a) • b = k(a • b ) (associative with scalar multiplication) 


Properties of the scalar product 

(i) a • a = | a | 2 . 

(ii) The scalar product a • b = 0 if (and only it) a and b are perpendicular (or one of 
them is zero). 

(iii) If {*, j, k} is an orthonormal basis then 

i'i = j'j = k'k = 1, i • j = j • k = k • i = 0. 

(iv) If a i = k\i + tx\j + v\k and d 2 = kji + 1 x 2 j + V 2 k then 


d\ • «2 = A-1 A.2 + Ml M2 + f2- 


Example 1.5 Numerical example on the scalar product 

If d = 2* — j + 2k and b = 4i — 3 k, find the magnitudes of a and b and the angle 
between them. 

Solution 

| a | 2 = d‘d = (2 1 — j + 2k) • (2/ — j + 2k) = 2 2 + (—1) 2 + 2 2 = 9. Hence |a| = 3. 
Similarly \b\ 2 = 4 2 +0 2 +(-3) 2 = 25 so that | b \ = 5. Alsoa-6 = 8+0+(-6) = 2. 
Since a • b = | a 11 b \ cos 0, it follows that 2 = 3 x 5 x cos 0 so that cos 0 = 2/15. 

Hence the magnitudes of a and b are 3 and 5, and the angle between them is 
cos"'(2/15). ■ 

Example 1.6 Apollonius’s theorem 

In the triangle OAB, M is the mid-point of AB. Show that (OA) 2 + (OB) 2 = 
2(0 M) 2 + 2(AM) 2 . 

Solution 

Let the vertex O be the origin of vectors and let the position vectors of A and B be a 
and b. Then the position vector of M is ^(a + b). Then 

4(0M) 2 = |a + b\ 2 = (a + b) • (a + b) 

= d’a + b‘b + 2a-b = \ a\ 2 + \b\ 2 + 2a • b 
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FIGURE 1.8 The component of v in the 
direction of the unit vector n is equal to OV f , 
the projection of O V onto the line through 
O parallel to /i. 



and 

4{AM) 2 = (AB) 2 = |a — b | 2 = (a — b) • (a — b) 

= a • a + b • b - 2a • b = \a\ 2 + \b\ 2 —2a • b. 


IIence 


2(0 M) 2 + 2(AM) 2 = \ a\ 2 + \b\ 2 = ( OA) 2 + (OB ) 2 
as required. ■ 


Components of a vector 

Definition 1.9 Components of a vector Let n be a unit vector. Then the component 

of the vector v in the direction of n is defined to be v • n. The component of v in the 
direction of a general vector a is therefore v • a. 

Properties of components 

(i) The component v • n has a simple geometrical significance. Let OV be a represen¬ 
tation of v as shown in Figure 1.8. Then 

v • n = | v || n\ cos# = OV cos# = OV ', 

where OV' is the projection of OV onto the line through O parallel to n. 

(ii) Suppose that v is a sum of vectors, v = v\ + t>2 + v 3 say. Then the component of 
v in the direction of n is 


v • n = (v \ + V 2 + V 3 ) * n = Oi • n) + (i >2 • n) + (>3 • n ), 

by the distributive law for the scalar product. Thus, the component of the sum of a 
number of vectors in a given direction is equal to the sum of the components of the 
individual vectors in that direction. 

(iii) If a vector v is expanded in terms of a general basis set {a, b, c} in the form v = 
k a + p. b + v c, the coefficients k, /x, v arc not the components of the vector v in the 
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FIGURE 1.9 The vector product 
axb = (| a 11 b | sin 0 ) n. 



directions of a, b , c. However if v is expanded in terms of an orthononnal basis set 
{/, j, k) in the form v = k i + p j + vk, then the component of v in the /-direction 
is 


v • i = (ki + p j + vk) • i = X(i • i) + /x(y • *) + r>(^ • /) 
= A. + 0 + 0 = k. 


Similarly /u and v arc the components of t; in the j- and ^-directions. Hence when 
a vector v is expanded in terms of an orthonormal basis set {*, j, k} in the form 
v = ki + p j + vk, the coefficients k, p, v are the components of v in the i- j- 
and k-directions. 

Example 1.7 Numerical example on components 

If v = 6i — 3 j + 15 k and a = 2/ — j — 2k, find the component of v in the direction 

of a. 

Solution 

| a | 2 = a • a = 2 2 + (— l) 2 + (—2) 2 = 9. Hence |a| = 3 and 


a 2i-j-2k 



The required component of v is therefore 


v • a = (6 / — 3 j + 15 A:) * 




12 + 3-30 
3 


-5. ■ 


1.4 THE VECTOR PRODUCT axb 

Definition 1.10 Vector product Suppose the vectors a and b have representations O A 

and OB and let n be the unit vector perpendicular to the plane OAB and such that 
{a < b. n} is a right-handed set. Then the vector product axb of a and b is defined by 

a x b = (| a 11 b\ sin#) n, (1.4) 
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where 0 (0 < 6 < 180°) is the angle between O A and OB. [Note that ax b is a vector 
quantity.] 


Laws of algebra for the vector product 

(i) bxa = —axb (anti-commutative law) 

(ii) ax(b+c) = axb + axc (distributive law) 

(iii) (ka) x b = k(a x b) (associative with scalar multiplication) 

Since the vector product is anti-commutative, the order of the terms in vector prod¬ 
ucts must be preserved. The vector product is not associative. 


Properties of the vector product 

(i) a x a = 0. 

(ii) The vector product axb = 0 if (and only if) a and b are parallel (or one of them is 
zero). 

(iii) If {i, j, k} is a standard basis then 


i x j = k, k x i = j, j x k = i, ixi = jxj = kxk = 0. 


(iv) If a\ = k\i + p\j + v\k and «2 = kji + p*2j + k then 


fl] Xfl2 = 


i j k 

k\ p\ vi 
ki P2 v 2 


where the determinant is to be evaluated by the first row. 


Example 1.8 Numerical example on vector product 


If a = 2 1 — j + 2k and b = —i — 3k, find a unit vector perpendicular to both a and 
b. 


Solution 

The vector axb is perpendicular to both a and b. Now 


axb — 


i j k 

2-1 2 
-1 0 -3 


= (3-0)i- ((-6) - (-2)) j + (0 - 1) k 


= 3i + 4 j - k. 


The magnitude of this vector is (3 2 + 4 2 + (—l) 2 )'^ = (26) ^ 2 . Hence the required 
unit vector can be either of ± (3/ + 4 j — k) /(26)'^ 2 . ■ 
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1.5 TRIPLE PRODUCTS 

Triple products are not new operations but are simply one product followed by 
another. There are two kinds of triple product whose values are scalar and vector respec¬ 
tively. 

Triple scalar product 

An expression of the form a • ( bxc ) is called a triple scalar product; its value is a scalar. 

Properties of the triple scalar product 

(i) 


a • (bxc) = c • (axb) = b • (cxa), (1.5) 

that is, cyclic permutation of the vectors a.b, c in a triple scalar product leaves its 
value unchanged. [Interchanging two vectors reverses the sign.] 

This formula can alternatively be written 

a • (bxc) = (axb) • c, (1.6) 

that is, interchanging the positions of the ‘dot' and the ‘cross’ in a triple scalar 
product leaves its value unchanged. 

Because of this symmetry, the triple scalar product can be denoted unambiguously 
by [a, b , c]. 

(ii) The triple scalar product [ a. b, c \ = 0 if (and only if) a. b, c are coplanar (or one 
of them is zero). In particular a triple scalar product is zero if two of its vectors are 
the same. 

(iii) If [a.b. c] >0 then the set [a.b. c } is right-handed. If [a, b, c] < 0 then the set 
[a.b.c] is left-handed. 

(iv) If ai = X\i + p\j + v\k, a 2 = ^-21 + Hi j + ^ 2 «3 = A. 3 * + p?,j + v 3 k. where 
[i, j, k) is a standard basis, then 


[ai, 02 , 03 ] 


k\ p 1 Vi 
A-2 P2 ^2 
A-3 P?> 


(1.7) 


Triple vector product 

An expression of the form ax(bxc) is called a triple vector product; its value is a vector. 

Property of the triple vector product 

Since bxc is perpendicular to both b and c. it follows that ax (bxc) must lie in the 
same plane as b and c. It can therefore be expanded in the form A .a + pb. The actual 
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formula is 


a x (b x c) = (a • c) b — (a • b) c. (1.8) 

Since the vector product is anti-commutative and non-associative, it is wise to use this 
formula exactly as it stands. 

Example 1.9 Using triple products 

Expand the expression (ax b) • ( cxd ) in terms of scalar products. 

Solution 

Use the triple scalar product formula (1.6) to interchange the first ‘dot’ and ‘cross’, 
and then expand the resulting triple vector product by the formula (1.8), as follows: 

(axb) • (cxd) = a • [bx(cxd)] = a -[(b • d)c — (b • c)d] 

= (a • c)(b • d) — (a • d)(b • c) ■ 


1.6 VECTOR FUNCTIONS OF A SCALAR VARIABLE 


In practice, the value of a vector quantity often depends on a scalar variable such 
as the time t. For example, if A is the label of a particle moving through space, then its 
position vector a (relative to a fixed origin O) will vary with time, that is, a = a(t). The 
vector a is therefore a function of the scalar variable t. 

The time dependence of a vector need not involve motion. The value of the electric 
or magnetic field at a fixed point * of space will generally vary with time so that E = E(t) 
and B = B(t). More generally, the scalar variable need not be the time. Consider the 
space curve C shown in Figure 1.10, whose points are parametrised by the parameter a. 
Each point of the curve has a unique tangent line whose direction can be characterised by 
the unit vector t. This is called the unit tangent vector to C and it depends on a, that is, 
t = t(a). In this case the independent variable is the scalar a and (just to confuse matters) 
the dependent variable is the vector t. 


Differentiation 

The most important operation that can be carried out on a vector function of a scalar 
variable is differentiation. 


Definition 1.11 Differentiation of vectors Suppose tlwt the vector v is a function of 
the scalar variable a, that is, v = v(a). Then the derivative of the function v(oc) with 
respect to a is defined by the limit‘d 


dv . ( v(cr + Act) — v(a ) 

— = Inn - 

da Aa—*o \ Act 


(1.9) 


* We will not be concerned here with vector functions of position. These are called vector fields. 

' Mathematical note: The statement u(a) —> U as a —> A means that | u(a) — l/|—►Oascr—»/t. 
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This looks identical to the definition of the derivative of an ordinary real function, but there 
is a difference. When a changes to a+A a, the function v changes from v(ar) to ufcr+Aa;), 
a difference of tifa+Aa) — v(a). However, this ‘difference’ now means vector subtraction 
and its value is a vector; it remains a vector after dividing by the scalar increment Acr. 
Hence dv/da, the limit of this quotient as Aor —> 0, is a vector. Furthermore, since 
dv/da depends on a, it is itself a vector function of the scalar variable a. The rules for 
differentiating combinations of vector functions are similar to those for ordinary scalar 
functions. 


Differentiation rules for vector functions 

Let u(a ) and v(a) be vector functions of the scalar variable or, and let A(a) be a 
scalar function. Then: 


(i) 


— (u + v) = ii + i) 
da 


(ii) —(Am) = Am + Am 
da 


(iii) — (u • v) 
da 


U • V + M • V 


(iv) — (M X u) = HXD + Kxil 
da 


where ii means du/da and so on. Note that the order of the terms in the vector 
product formula must be preserved. 


Example 1.10 Differentiating vector functions 

(i) The position vector of a particle P at time t is given by 

r = (It (i) 2 - 50 i + (4/ + 2 )j + t 3 k, 

where {/, y, k) is a constant basis set. Find dr/dt and d 2 r/dt 2 . (These are the 
velocity and acceleration vectors of P at time t .) 

(iii) If a = a(t) and b is a constant vector, show that 

d 

— \a • (axb)\ = a • {axb). 


Solution 

(i) Since i, j, k arc constant vectors, it follows from the differentiation rules that 


— = (4r - 5) i + 4 y + 3t 2 k. 


d 2 r 

~di 2 


4/ + 6t k. 
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FIGURE 1.10 The unit tangent vector t(a) at a typical point A on the curve C, 
defined parametrically by r = r(a). 


(ii) 

-^-[a • (a x b)] = a • (a x b) + a • (a x b = 0 + a • (ax b + a x b^j 
= a • (a x b + a x o) = a • (a x b ), 

as required. ■ 

1.7 TANGENT AND NORMAL VECTORS TO A CURVE 

In the next chapter we will define the velocity and acceleration of a panicle mov¬ 
ing in a space of three dimensions. In order to be able to interpret these definitions, we 
need to know a little about the differential geometry of curves. In particular, it is useful to 
know what the unit tangent and unit normal vectors of a curve are. 

Unit tangent vector 

Consider the curve C shown in Figure 1.10 which is defined by the parametric equation 
r = r(a). In general this can be a curve in three-dimensional space. Let A be a typical 
point of C corresponding to the parameter a and A' a nearby point corresponding to the 

parameter a + Ao\ The chord A A' represents the vector 

A r = r(a + Aa) — r(a) 

and so Ar/| A r | is a unit vector parallel to the chord AA'. The unit tangent vector t(a) 
at the point A is defined to be the liixit of this expression as A' —»• A, that is 

A r 

t(a) = lim -. 

Aa—>0 I Ar I 
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y 



FIGURE 1.11 The cycloid x = a(9 — sind), y = a(l — cos 9), 
z = 0, where 0 < 9 < 2tt. 


The tangent vector t is related to the derivative dr/da since 


dr . A r . A r . |Ar| 

— = lim — = lim - x Inn - 

da Aa—>0 Ao? Aa^-o|Ar| Aa^-o Acr 



A r 


dr 

t(a) x 

Inn - 

Aa— *0 Aa 

= t(a) x 

da 


that is, 


dr 

dr 


— — 

da 

t(a). 

da 


( 1 . 10 ) 


Example 1.11 Finding the unit tangent vector 

Figure 1.11 shows the cycloid x = a(6 — sin 9), y = «(1 — cosd), z = 0, where 
0 < 6 < 2tt . Find the unit tangent vector to the cycloid at the point with parameter 
9. 


Solution 

Let i, j be unit vectors in the directions Ox, Oy respectively. Then the vector form 
of the equation for the cycloid is 


Then 


and 


r = a (9 — sin9)i + o (1 — cos0) j. 


dr 

— = a (1 — cos 9)i + (a sin 9) j 
d9 


dr 

~d9 


= a (2 — 2cos0)^ 2 


2 a sin ^9. 
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Hence the unit tangent vector to the cycloid is 

m 


d^l\d_r_ 
' dO/ | dO 


= (sin lO) i + (cos A 0) j, 


after simplification. ■ 

The formula (1.10) takes its simplest form when the parameter a is taken to be s, the 
distance along the curve measured from some fixed point. In this case, 


dr 


ds 


Inn 


| A/* | 


As—*0 A.V 


I 


so that t (pointing in the direction of increasing s) is given by the simple formula 



( 1 . 11 ) 


This is the most convenient formula for theoretical purposes. 

Unit normal vector 

Let t(s) be the unit tangent vector to the curve C, where the parameter s represents distance 
along the curve. Then, since t is a vector function of the scalar variable s, it has a derivative 
dt/ds which is another vector function of s. 

Since / is a unit vector it follows that t(s)>t(s) = 1 and if we differentiate this identity 
with respect to s, we obtain 


d dt 

0 = — (t ■ t) = — 
ds ds 


t + t 


dt 

ds 



It follows that dt/ds is always perpendicular to t. It is usual to write dt /ds in the form 


dt 

ds 


k n 


( 1 . 12 ) 


where k = \dt/ds\, a positive scalar called the curvature, and n is a unit vector called 
the (principal) unit normal vector. At each point of the curve, the unit vectors t(s ) and 
n(s) are mutually perpendicular. 

The quantities n and k have a nice geometrical interpretation. Let A be any point on 
the curve and suppose that the distance parameter s is measured from A. Then, by Taylor’s 


Copyrighted material 









1.7 Tangent and normal vectors to a curve 


21 


theorem, the form of the curve C near A is given approximately by 

d 2 r 


r{s) = r(0) + s 


dr 

ds 


J.v=0 


+ \s 2 


ds 2 


+ 0 (■'"’) ■ 


J v=0 


that is, 


r(s ) = a + s t + ( 5 /cs 2 ) n + O (s 3 j , 


(1.13) 


where a is the position vector of the point A, and t, k and n are evaluated at the point A. 
Thus, near A, the curve C lies* in the plane through A parallel to the vectors t and it. We 
can also see from equation (1.13) that, near A, the curve C is approximately a parabola. 
To the same order of approximation, it is equally true that, near A, the curve C is given by 

r(s) = a + (sin/cs) / + k~ { (1 — cos ks) n + O rr 3 ^ . (1.14) 


Thus, near A, the curve C is approximately a circle of radius *r _1 ; the vector t is tangential 
to this circle and the vector n points towards its centre. The radius k ~ 1 is called the radius 
of curvature of C at the point A. 


Example 1.12 Finding the unit normal vector and curvature 


Find the unit normal vector and curvature of the cycloid x = a(0 — sin#), y = 
a( \ — cos 9), z = 0 , where 0 < 6 < 2 n. 


Solution 

The tangent vector to the cycloid has already been found to be 

dr 


= -/ 

dO / 


m 

Hence, by the chain rule. 


dO 


= (sin j9) i + (cos \0) j. 


dt dt/dO dt/dO |(cos ^0) i — ^(sin j9) j 
ds ds/dd \dr/d0\ lasm^G 

= (Ao sin j0^ ^(cos ^ 9 ) i — (sin ^ 0 ) j'j . 


Hence the unit normal vector and curvature of the cycloid arc given by 
n{9) = (cos ^0) i — (sin ^0) j, k(9) = ^4 a sin ^0^ 

The radius of curvature of the cycloid is therefore 4 a sin \0. ■ 


* More precisely, this plane makes three point contact with the curve C at the point A. 
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Problems on Chapter 1 

Answers and comments are at the end of the book. 
Harder problems carry a star (*). 


1.1 In terms of the standard basis set {*, j, k}, a = 2i — j — 2k. b — 3i — 4k and c — 
i -5 j + 3k. 

(i) Find 3a + 2b — 4c and | a — b\ 2 . 

(ii) Find | a |, \b \ and a • b. Deduce the angle between a and b. 

(iii) Find the component of c in the direction of a and in the direction of b. 

(iv) Find a x b. bx c and (a x b) x (bx c). 

(v) Find a • (bxc) and (ax b) • c and verify that they are equal. Is the set {a. b, c) right- 
or left-handed? 

(vi) By evaluating each side, verify the identity ax (bxc) = (a • c)b — (a • b) c. 

Vector geometry 

1 .2 Find the angle between any two diagonals of a cube. 

1.3 A BC DEF is a regular hexagon with centre O which is also the origin of position vectors. 
Find the position vectors of the vertices C, D, E. F in terms of the position vectors a. b of A 
and B. 

1.4 Let ABC D be a general (skew) quadrilateral and let P , Q. R. S be the mid-points of the 
sides AB. BC.CD, DA respectively. .Show that PQRS is a parallelogram. 

1.5 In a general tetrahedron, lines are drawn connecting the mid-point of each side with the 
mid-point of the side opposite. Show that these three lines meet in a point that bisects each of 
them. 

1. 6 Let ABC D be a general tetrahedron and let P. Q. R. S be the median centres of the faces 
opposite to the vertices A, B, C, D respectively. Show that the lines AP. BQ. CR. DS all 
meet in a point (called the centroid of the tetrahedron), which divides each line in the ratio 3: 1 . 

1.7 A number of particles with masses iii \, in 2 , /// 3 , ... are situated at the points with position 
vectors r\, /* 2 , r$,... relative to an origin O. The centre of mass G of the particles is defined 
to be the point of space with position vector 

m\r\ +m 2 r 2 + «<3i*3 4- 

K — - 

mi + in 2 + in 3 H- 

Show that if a different origin O' were used, this definition would still place G at the same 
point of space. 

1 .8 Prove that the three perpendiculars of a triangle are concurrent. 

[Construct the two perpendiculars from A and B and take their intersection point as O. the origin of position 
vectors. Then prove that OC must be perpendicular to AB.] 
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Vector algebra 

1.9 If a\ = k\i + p\j + v\k, ai = A. 2 / + M 2 ./ + vik, «3 = k^i + psj-\-vsk, where {i, j, k} 
is a standard basis, show that 


a\ • (,fl 2 xa3) = 


k\ li 1 iq 

A -2 M 2 ^2 
A-3 M3 


Deduce that cyclic rotation of the vectors in a triple scalar product leaves the value of the 
product unchanged. 


1.10 By expressing the vectors a. b. c in terms of a suitable standard basis, prove the identity 
ax(bxc) = (a • c) b — (a • b) c. 


1.11 Prove the identities 


(i) (a x b) • (cx d) = (a • c)(b • d) — (a • d)(b • c) 

(ii) (axb)x(cxd) = [a, b, d\c — [a, b, c\d 

(iii) ax(bxc) + cx(axb) + bx(cxa) = 0 (Jacobi’s identity) 


1.12 Reciprocal basis Let {a, b , c} be any basis set. Then the corresponding reciprocal 
basis {a*, b*, c*} is defined by 

* bxc ,* cxa * axb 

a = -, b = -, c = -. 

\a,b,c | \a,b,c] \a,b,c\ 

(i) If {*, j, k} is a standard basis, show that { 1 *, j*, k*} = {/, j, k). 

(ii) Show that f a*, b*, c* ] = 1/f a, b, c]. Deduce that if {a, b, c} is a right handed set then 
so is [a*, b*, c*}. 

(iii) Show that { (a*)*, (b*)*, ( c*)* ) = {a,b,c}. 

(iv) If a vector i> is expanded in terms of the basis set { a. b, c} in the form 

v = Xa + pb+vc, 

show that the coefficients k, p,v are given by A. = v • a*, p = i> • b*, v = v • c*. 


1.13 Lamp's equations The directions in which X-rays arc strongly scattered by a crystal arc 
determined from the solutions x of Lame’s equations, namely 

X'd=L, X'b=M, X'C—N , 

where {a, b , c) are the basis vectors of the crystal lattice, and L, M, N are any integers. Show 
that the solutions of Lame’s equations are 

x — L a* + M b* + N c*, 

where {a*, b* , c*} is the reciprocal basis to {a, b , c}. 
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Differentiation of vectors 

1.14 If r{t) — (3 1~ — 4) i + / 3 j + (t + 3) k, where {i, j, k} is a constant standard basis, find 
r and r. Deduce the time derivative ofrxr. 

1.15 The vector v is a function of the time t and k is a constant vector. Find the time deriva¬ 
tives of (i) | v | 2 , (ii) (v • k) v, (iii) [ v, v , &]. 

1.16 Find the unit tangent vector, the unit normal vector and the curvature of the circle x = 
a cos 0, y = a sin#, z = 0 at the point with parameter 6. 

1.17 Find the unit tangent vector, the unit normal vector and the curvature of the helix x = 
a cosO, y = a sin 0 , z = bO at the point with parameter 0 . 

1.18 Find the unit tangent vector, the unit normal vector and the curvature of the parabola 
x = ap 2 , y = 2ap, z = 0 at the point with parameter p. 
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Chapter Two 


Velocity, acceleration 
and scalar angular velocity 


KEY FEATURES 

The key concepts in this chapter are the velocity and acceleration of a particle and the angular 
velocity of a rigid body in planar motion. 


Kinematics is the study of the motion of material bodies without regard to the forces 
that cause their motion. The subject does not seek to answer the question of why bod¬ 
ies move as they do; that is the province of dynamics. It merely provides a geometrical 
description of the possible motions. The basic building block for bodies in mechanics is 
the particle, an idealised body that occupies only a single point of space. The impor¬ 
tant kinematical quantities in the motion of a particle are its velocity and acceleration. 
We begin with the simple case of straight line particle motion, where velocity and accel¬ 
eration arc scalars, and then progress to three-dimensional motion, where velocity and 
acceleration are vectors. 

The other important idealisation that we consider is the rigid body, which we regard 
as a collection of particles linked by a light rigid framework. The important kinematical 
quantity in the motion of a rigid body is its angular velocity. In this chapter, we con¬ 
sider only those rigid body motions that are essentially two-dimensional, so that angular 
velocity is a scalar quantity. The general three-dimensional case is treated in Chapter 16. 

2.1 STRAIGHT LINE MOTION OF A PARTICLE 

Consider a particle P moving along the .r-axis so that its displacement x from the 
origin O is a known function of the time t. Then the mean velocity of P over the time 

o P 

- 1 -•-► x 

-► V 

FIGURE 2.1 The particle P moves in a straight line and 
has displacement x and velocity v at time t . 
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interval t\ < t < t 2 is defined to be the increase in the displacement of P divided by the 
time taken, that is, 


x(t 2 ) ~ *(h) 
h ~ h 


(2.1) 


Example 2.1 Mean velocity 

Suppose the displacement of P from O at time / is given by x — t 2 — 6/, where x 
is measured in metres and t in seconds. Find the mean velocity of P over the time 
interval 1 < t < 3. 


Solution 

In this case, x(l) = —5 and r(3) = —9 so that the mean velocity of P is ((—9) — 
(—5))/(3 — 1) = -2 ms" 1 . ■ 

The mean velocity of a particle is less important to us than its instantaneous velocity, 
that is, its velocity at a given instant in time. We cannot find the instantaneous velocity of 
P at time t\ merely by letting t 2 = t\ in the formula (2.1), since the quotient would then 
be undefined. However, we can define the instantaneous velocity as the limit of the mean 
velocity as the time interval tends to zero, that is, as t 2 t\. Thus v(t\), the instantaneous 
velocity of P at time t\ can be defined by 


u(fi) = lim 
t2-*h 


( X(t 2 ) -*(fl) \ 

V t 2 -h )' 


But this is precisely the definition of dx/dt, the derivative of x with respect to t , evaluated 
at t = t\. This leads us to the official definition: 

Definition 2.1 1-D velocity The (instantaneous) velocity v of P , in the positive x- 
direction, is defined by 


v = 


dx 

It' 


(2.2) 


The speed of P is defined to be the rate of increase of the total distance travelled and is 
therefore equal to \ v |. 

Similarly, the acceleration of P, the rate of increase of v, is defined as follows: 

Definition 2.2 1-D acceleration The (instantaneous) acceleration a of P . in the posi¬ 
tive x-direction, is defined by 


dv d 2 x 

It = It 2 ' 


(2.3) 


Example 2.2 Finding rectilinear velocity and acceleration 

Suppose the displacement of P from O at time / is given by x = r — 6 1 2 + 4, where 
x is measured in metres and t in seconds. Find the velocity and acceleration of P at 
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FIGURE 2.2 The particle P moves in three-dimensional 
space and, relative to the reference frame T and origin 0> 
has position vector r at time t . 


2.2 GENERAL MOTION OF A PARTICLE 

When a particle P moves in two or three-dimensional space, its position can be 
described by its vector displacement r from an origin O that is fixed in a rigid reference 
frame T. Whether T is moving or not is irrelevant here; the position vector r is simply 
measured relative to T. Figure 2.2 shows a particle P moving in three-dimensional space 
with position vector r (relative to the reference frame J -) at time t. 

Question Reference frames 

What is a reference frame and why do we need one? 

Answer 

A rigid reference frame T is essentially a rigid body whose particles can be labelled 
to create reference points. The most familiar such body is the Earth. Relative to 
a single particle, the only thing that can be specified is distance from that particle. 
However, relative to a rigid body, one can specify both distance and direction. Thus 
the value of any vector quantity can be specified relative to T. In particular, if we 
label some particle O of the body as origin, we can specify the position of any point 
of space by its position vector relative to the frame T and the origin O . 

The specification of vectors relative to a reference frame is much simplified if we 
introduce a Cartesian coordinate system. This can be done in infinitely many different 
ways. Imagine that T is extended by a set of three mutually orthogonal planes that are 
rigidly embedded in it. The coordinates x, y, z of a point P are then the distances of P 
from these three planes. Let O be the origin of this coordinate system, and {*, j, k} 
its unit vectors. We can then conveniently refer to the frame T, together with the 
embedded coordinate system Oxyz, by the notation T {O ; *, j, &}. ■ 

In general motion, the velocity and acceleration of a particle are vector quantities and 
are defined by: 


Copyrighted material 





2.2 General motion of a particle 


29 


Definition 2.3 3- D velocity and acceleration The velocity v and acceleration a of P 

are defined by 


dr 

dv 

— 

and a = — 

dt 

dt 


(2.4) 


Connection with the rectilinear case 

The scalar velocity and acceleration defined in section 2.1 for the case of straight line motion are simply 
related to the corresponding vector quantities defined above. It would be possible to use the vector for¬ 
malism in all cases but, for the case of straight line motion along the .v-axis, r, u, and a would have the 
form 

r = xi , v = ui, a — ai , 

where v = dx/dt and a = dv/dt. It is therefore sufficient to work with the scalar quantities x, v and a; 
use of the vector formalism would be clumsy and unnecessary. 

Example 2.4 Finding 3- D velocity and acceleration 

Relative to the reference frame T{0 ; i\ y\ A:}, the position vector of a particle P at 
time t is given by 

r = (:It 2 - 3) i + (4/ + 4) j + ( t 3 + 2 1 2 ) k. 

Find (i) the distance OP when t = 0, (ii) the velocity of P when t = 1, (iii) the 
acceleration of P when t = 2. 

Solution 

In this solution we will make use of the rules for differentiation of sums and products 
involving vector functions of the time. These rules arc listed in section 1.6. 

(i) When t = 0, r = — 3 1 + 4 j so that OP = \ r \ = 5. 

(ii) Relative to the reference frame T, the unit vectors {i, j, k } are constant and so 
their time derivatives are zero. The velocity v of P is therefore 

v = dr/dt =4ti + 4j + (3 1 2 + 4 1) k. 

When t = 1, v = 4i + 4 j + Ik. 

(iii) Relative to the reference frame J 7 , the acceleration a of P is 

a = dv/dt = 4i + (6/ + 4) k. 

When t — 2, a — 4i + 16&. ■ 

Interpretation of the vectors v and a 

The velocity vector v has a simple interpretation. Suppose that s is the arc-length travelled 
by P, measured from some fixed point of its path, and that s is increasing with time.* 


* The arguments that follow assume a familiarity with the unit tangent and normal vectors to a general 
curve, as described in section 1.7 
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dr — 
— = 0 

dd 

dd 

dd " 


( 2 . 8 ) 


Suppose now that P is a moving particle with polar co-ordinates r, 8 that are functions 
of the time t. The position vector of P relative to O has magnitude OP = r and direction 
T and can therefore be written 


r = r r. 


(2.9) 


In what follows, one must distinguish carefully between the position vector r, which is 

the vector OP, the co-ordinate r, which is the distance OP, and the polar unit vector T. 

To obtain the polar formula for the velocity of P, we differentiate formula (2.9) with 
respect to t. This gives 


dr d ^ 
v = — = — (r r ) = 
dt dt 

= '" ?+ '0 


dr 

dt 


r + r 


(i? 

~di , 


( 2 . 10 ) 

( 2 . 11 ) 


We will use the dot notation for time derivatives throughout this section; r means dr /dt, 
d means dd/dt, r means d 2 r/dt 2 and d means d 2 d/dt 2 . 

NowT is a function of 0 which is, in its turn, a function of t. Hence, by the chain rule 
and formula (2.8), 


d? 

It 


dir dO ^ 

— x — = 6 x0 = 60. 
dd dt 


If we now substitute this formula into equation (2.11) we obtain 


v = r'r + (r(?) 6, 


( 2 . 12 ) 


which is the polar formula for the velocity of P. 

To obtain the polar formula for acceleration, we differentiate the velocity formula 
(2.12) with respect to t. This gives* 


dv d _ d , . 

a= Ti = It { r) + It 

_ dr , . ... ^ , dO 

= rr + r — + (rd + rO) 0 + (rd) — 
dt v dt 

d'r dd\ 


d0 dd 


= r r + r l - 

\de 


x _ +{ r0 + r8)0 + (re) -x 


dt 


= r'r + (r0) 0 + (rd + rd) 0 — (rd 2 ^'r 
= (r - rd 2 } r + (rd + 2rd)0. 


* Be a hero. Obtain this formula yourself without looking at the text. 
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which is the polar formula for the acceleration of P. These results are summarised 
below: 


Polar formulae for velocity and acceleration 

If a particle is moving in a plane and has polar coordinates r, 6 at time /, then its 
velocity and acceleration vectors are given by 

v = rr + (rd)0, (2.13) 

a = (r - / <9 2 ) r + (r$ + 2 id) 6 . (2.14) 


The formula (2.13) shows that the velocity of P is the vector sum of an outward radial 
velocity r and a transverse velocity rd; in other words v is just the sum of the velocities 
that P would have if r and 6 varied separately. This is not true for the acceleration as it will 
be observed that adding together the separate accelerations would not yield the term 2 id 9. 
This ‘Coriolis term’ is certainly present however, but is difficult to interpret intuitively. 

Example 2.6 Velocity and acceleration in polar coordinates 

A particle sliding along a radial groove in a rotating turntable has polar coordinates at 
time t given by 


/• = ct d = Sit, 

where c and £2 are positive constants. Find the velocity and acceleration vectors of 
the particle at time t and find the speed of the particle at time t. 

Deduce that, for t > 0, the angle between the velocity and acceleration vectors is 
always acute. 

Solution 

From the polar formulae (2.13), (2.14) for velocity and acceleration, we obtain 
v = c'r + (ct)SlO = c (r + Sit 0) 


and 

a = (0 — (ct)Sl 2 ^ r + (0 + 2cSl)0 = ci 1 (-Sit? + 20) . 

The speed of the particle at time t is thus given by 1 1 >| = c (1 + Sl 2 t 2 ) ^ 2 . 
To find the angle between v and a, consider 

v • a = c 2 Sl(—Slt + 2 Sit) = c 2 Sl 2 t 
> 0 

for t > 0. Hence, for t > 0, the angle between v and a is acute. ■ 
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General circular motion 

An important application of polar coordinates is to circular motion. We have already 
considered the special case of uniform circular motion, but now we suppose that P moves 
in any manner (not necessarily with constant speed) around a circle with centre O and 
radius b. If we take O to be the origin of polar coordinates, the condition r = b implies 
that f = r = 0 and the formula (2.13) for the velocity of P reduces to 

v=(bd)6. (2.15) 

This result is depicted in Figure 2.5. The transverse velocity component bd (which is not 
necessarily the speed of P since 6 may be negative) is called the circumferential velocity 
of P. Circumferential velocity will be important when we study the motion of a rigid 
body rotating about a fixed axis; in this case, each particle of the rigid body moves on a 
circular path. 

The corresponding formula for the acceleration of P is 

a = (0 - bd 2 } r + + (( bO + 0 ) d 
= - (bd 2 }?+ + ((bd)d 

where v is the circumferential velocity bd. These results are summarised below: 


General circular motion 

Suppose a particle P moves in any manner around the circle r = b, where r, 6 are 
plane polar coordinates. Then the velocity and acceleration vectors of P are given 
by 


(2.16) 
(2.17) 

where v (= bd) is the circumferential velocity of P. 



The formula (2.17) shows that, in general circular motion, the acceleration of P is the 
(vector) sum of an inward radial acceleration v 2 /b and a transverse acceleration u. This 
is consistent with the general formula (2.5). Indeed, what the formula (2.5) says is that, 
when P moves along a completely general path, its acceleration vector is the same as if it 
were moving on the circle of curvature at each point of its path. 
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v= (b0)6 



FIGURE 2.5 The particle P moves on the circle with centre O 
and radius b. At time t its angular displacement is 9 and its 
circumferential velocity is b9. 

Example 2.7 Pendulum motion 

The bob of a certain pendulum moves on a vertical circle of radius b and, when the 
string makes an angle 6 with the downward vertical, the circumferential velocity v of 
the bob is given by 

v 2 = 2gbcos9, 

where g is a positive constant. Find the acceleration of the bob when the string makes 
angle 0 with the downward vertical. 

Solution 

From the acceleration formula (2.17), we have 

a = — ^^ 'r + vO = - ( 2 g cost?) r'+ vO. 

It remains to express i) in terms of 0. On differentiating the formula v 2 = 2gbcos0 
with respect to / , we obtain 

2vv = — (2gb sint?) 0, 

and, since bO = v, we find that 


v = —g sin0. 


Hence the acceleration of the bob when the string makes angle 9 with the downward 
vertical is 


a = — (2 g cos 9) T — (g sin 0) 0. ■ 


2.4 RIGID BODY ROTATING ABOUT A FIXED AXIS 

Some objects that we find in everyday life, such as a brick or a thick steel rod, 
are so difficult to deform that their shape is virtually unchangeable. We model such an 
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FIGURE 2.6 The rigid body B rotates about the fixed axis Oz and has angular 
displacement 9 at time t. Each particle P of S moves on a circular path; the point 
P 0 is the reference position of P. 


object by a rigid body, a collection of particles forming a perfectly rigid framework. Any 
motion of the rigid body must maintain this framework. 

An important type of rigid body motion is rotation about a fixed axis; a spinning fan, 
a door opening on its hinges and a playground roundabout are among the many examples 
of this type of motion. Suppose B is a rigid body which is constrained to rotate about 
the fixed axis Oz as shown in Figure 2.6. (This means that the particles of B that lie on 
Oz are held fixed. Rotation about Oz is then the only motion of B consistent with rigid¬ 
ity.) At time t , B has angular displacement 6 measured from some reference position. 
The angular displacement 0 is the rotational counterpart of the Cartesian displacement x 
of a particle in straight line motion. By analogy with the rectilinear case, we make the 
following definitions: 

Definition 2.4 Angular velocity The angular velocity co of B is defined to be co = 
dd/dt and the absolute value of co is called the angular speed ofB. 

Units. Angular velocity (and angular speed) are measured in radians per second (rads -1 ). 

Example 2.8 Spinning crankshaft 1 

The crankshaft of a motorcycle engine is spinning at 6000 revolutions per minute. 
What is its angular speed in S.I. units? 

Solution 

6000 revolutions per minute is 100 revolutions per second which is 200;r radians per 
second. This is the angular speed in S.I. units. ■ 

Particle velocities in a rotating rigid body 

In rotational motion about a fixed axis, each particle P of B moves on a circle of some 
radius p, where p is the (fixed) perpendicular distance of P from the rotation axis. It then 
follows from (2.16) that the circumferential velocity v of P is given by pd, that is 


V = co p 


(2.18) 
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Example 2.9 Spinning crankshaft 2 

In the crankshaft example above, find the speed of a particle of the crankshaft that 
has perpendicular distance 5 cm from the rotation axis. Find also the magnitude of its 
acceleration. 

Solution 

In this case, |&>| = 2007 t and p = 1/20 so that the particle speed (the magnitude of 
the circumferential velocity u) is 10 tt & 31.4 ms -1 . 

Since the circumferential velocity is constant, |a| = v 2 /p = (107r) 2 /0.05 % 
2000 ms -2 , which is two hundred times the value of the Earth’s gravitational accel¬ 
eration! ■ 

2.5 RIGID BODY IN PLANAR MOTION 

We now consider a more general form of rigid body motion called planar motion. 

Definition 2.5 Planar motion A rigid body B is said to be in planar motion if each 
particle ofB moves in a fixed plane and all these planes are parallel to each other. 

Planar motion is quite common. For instance, any flat-bottomed rigid body sliding on a 
flat table is in planar motion. Another example is a circular cylinder rolling on a rough flat 
table. 

The particle velocities in planar motion can be calculated by the following method; 
the proof is given in Chapter 16. First select some particle C of the body as the reference 
particle. The velocity of a general particle P of the body is then the vector sum of 

(i) a translational contribution equal to the velocity of C (as if the body did not 
rotate) and 

(ii) a rotational contribution (as if C were fixed and the body were rotating with 
angular velocity co about a fixed axis through C). 

This result is illustrated in Figure 2.7, where the body is a rectangular plate and the refer¬ 
ence particle C is at a corner of the plate. The velocity v of P is given by v = v c + v R , 
where the translational contribution v c is the velocity of C and the rotational contribution 
v R is caused by the angular velocity co about C. Although the reference particle can be 
any particle of the body, it is usually taken to be the centre of mass or centre of symmetry 
of the body. 

Example 2.10 The rolling wheel 

A circular wheel of radius b rolls in a straight line with speed u on a fixed horizontal 
table. Find the velocities of its particles. 

Solution 

This is an instance of planar motion and so the particle velocities can be found by 
the method above. Let the position of the wheel at some instant be that shown in 
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FIGURE 2.7 The velocity of the particle P belonging to 
the rigid body B is the sum of the translational 
contribution v c and the rotational contribution v R . The 
reference particle C can be any particle of the body. 



FIGURE 2.8 The circular wheel rolls from left to right on a fixed horizontal 
table. The reference particle C is taken to be the centre of the wheel and 
the velocity of a typical particle P is the sum of the two velocities shown. 


Figure 2.8. The reference particle C is taken to be the centre of the wheel, and the 
wheel is supposed to have some angular velocity co about C. The velocity v ,J of a 
typical particle P is then the sum of the two velocities shown. In terms of the vectors 

{«'. j) 


v 1 = ui + cop (cos 6 i — sin 6 j ) 

= (w + cop cos 6) i — (top sin$) j. (2.19) 

In particular, on taking p = b and 6 = n, the velocity v@ of the contact particle Q is 
given by 


v® = (w — cob) i. 


( 2 . 20 ) 
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If the wheel is allowed to slip as it moves across the table, there is no restriction on 
v@ so that w and co are unrelated. But rolling, by definition, requires that 

v 2 =0. (2.21) 

On applying this rolling condition to our formula (2.20) for v@, we find that co must 
be related to u by 


co — (2.22) 

b 

and on using this value of co in (2.19) we find that the velocity of the typical particle 
P is given by 

v F — u ^1 + ^ cos 0^ i — u sin0^ j. (2.23) 

When P lies on the circumference of the wheel, this formula simplifies to 

v' J = u (1 + cos 6) i — u sin0 j, (2.24) 

in which case the speed of P is given by 

| v 1 ’ | = 2 u cos(0/2), (—7r < 0 < n). 

Thus the highest particle of the wheel has the largest speed, 2w, while the contact 
particle has speed zero, as we already know. ■ 


2.6 REFERENCE FRAMES IN RELATIVE MOTION 

A reference frame is simply a rigid coordinate system that can be used to specify 
the positions of points in space. In practice it is convenient to regard a reference frame 
as being embedded in, or attached to, some rigid body. The most familiar case is that in 
which the rigid body is the Earth but it could instead be a moving car, or an orbiting space 
station. In principle, any event, the motion of an aircraft for example, can be observed 
from any of these reference frames and the motion will appear different to each observer. 
It is this difference that we now investigate. 

Let the motion of a particle P be observed from the reference frames T {O ; i, j, k} 
and J-' {O' ; i, j, k } as shown in Figure 2.9. Here we are supposing that the frame T’ 
docs not rotate relative to T. This is why, without losing generality, we can suppose that 
T and T* have the same set of unit vectors {/, j, k }. For example, P could be an aircraft, 
T could be attached to the Earth, and F’ could be attached to a car driving along a straight 
road. 

Then, r, r\ the position vectors of P relative to J 7 , T' are connected by 

r = r’ + D, (2.25) 


where D is the position vector of O' relative to T . 
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FIGURE 2.9 The particle P is observed from the two reference frames T and T '. 

We now differentiate this equation with respect to t, a step that requires some care. 
Let us consider the rates of change of the vectors in equation (2.25), as observed from the 
frame T. Then 


dr'\ 

77 ) r +v ’ (2 ' 26> 

where v is the velocity of P observed in T and V is the velocity of T' relative to T. 

Now when two different reference frames are used to observe the same vector, the 
observed rates of change of that vector will generally be different. In particular, it is not 
generally true that 




However, as we will show in Chapter 17, these two rates of change are equal if the frame 
T does not rotate relative to T. Hence, in our case, we do have 



where v' is the velocity of P observed in T'. 
Equation (2.26) can then be written 


v = v + V 


(2.27) 


Thus the velocity of P observed in F is the sum of the velocity of P observed in T' and 
the velocity of the frame T' relative to T. This result applies only when T' does not 
rotate relative to T. 
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This is the well known rule for handling ‘relative velocities’. In the aircraft example, 
it means that the true velocity of the aircraft (relative to the ground) is the vector sum of 
(i) the velocity of the aircraft relative to the car, and (ii) the velocity of the car relative to 
the road. 

Example 2.11 Relative velocity 

The Mississippi river is a mile wide and has a uniform flow. A steamboat sailing at 
full speed takes 12 minutes to cover a mile when sailing upstream, but only 3 minutes 
when sailing downstream. What is the shortest time in which the steamboat can cross 
the Mississippi to the nearest point on the opposite bank? 

Solution 

The way to handle this problem is to view the motion of the boat from a reference 
frame T' moving with the river. In this reference frame the water is at rest and the 
boat sails with the same speed in all directions. The relative velocity formula (2.27) 
then gives us the true picture of the motion of the boat relative to the river bank, which 
is the reference frame T . 

Let u H be the speed of the boat in still water and u R be the speed of the river, both 
measured in miles per hour. The upstream and downstream times are just a sneaky 
way of telling us the values of u R and u R . When the boat sails downstream, (2.27) 
implies that its speed relative to the bank is u H + u R . But this speed is stated to be 
1/3 mile per minute (or 20 miles per hour). Hence 

u R +u R = 20. 

Similarly the upstream speed is u H — u R and is stated to be 1/12 mile per minute (or 
5 miles per hour). Hence 

u B -u R = 5. 


Solving these equations yields 

u B = 12.5 mph, u R = 7.5 mph. 

Now the boat must cross the river. In order to cross by a straight line path to the 
nearest point on the opposite bank, the boat’s velocity (relative to the water) must be 
directed at some angle a to the required path (as shown in Figure 2.10) so that its 
resultant velocity is perpendicular to the stream. For this to be true, a must satisfy 

u R sino; = u R , 

which gives sine* = 3/5. The resultant speed of the boat when crossing the river is 
therefore u H cose* = 12.5 x (4/5) = 10 mph. Since the river is one mile wide, the 
time taken for the crossing is 1/10 hour = 6 minutes. ■ 

The relative velocity formula (2.27) can be differentiated again with repect to t to give 
a similar connection between accelerations. The result is that 

a = a! + A, (2.28) 
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O -*- u B u B - O ^-► u R 

Downstream U pstream Across 

FIGURE 2.10 The river flows from left to right with speed u R and the boat sails with speed 
u B relative to the river. In each case the velocity of the boat relative to the bank is the 
vector sum of the two velocities shown. 


where a and a' are the accelerations of P relative to the frames T and T' respectively, 
and A is the acceleration of the frame T' relative to the frame T. Once again, this result 
applies only when T' does not rotate relative to T. 

Mutually unaccelerated frames 

An important special case of equation (2.28) occurs when the frame T' is moving with 
constant velocity (and no rotation) relative to T. We will then say that T and T' are 
mutually unaccelerated frames. In this case A = 0 and (2.28) becomes 

a = a'. (2.29) 

This means that when mutually unaccelerated frames are used to observe the motion of a 
particle P , the observed acceleration of P is the same in each frame. 

This result will be vital in our discussion of inertial frames in Chapter 3. 


Problems on Chapter 2 

Answers and comments are at the end of the book. 

Harder problems carry a star (*). 

Rectilinear particle motion 

2.1 A particle P moves along the jc-axis with its displacement at time t given by x = 6 1 2 — 
t 3 + 1, where x is measured in metres and t in seconds. Find the velocity and acceleration of 
P at time t. Find the times at which P is at rest and find its position at these times. 

2.2 A particle P moves along the ,v-axis with its acceleration a at time t given by 

a = 6/ — 4 m s -2 . 

Initially P is at the point x = 20 m and is moving with speed 15 ms -1 in the negative x- 
direction. Find the velocity and displacement of P at time t. Find when P comes to rest and 
its displacement at this time. 
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2.3 Constant acceleration formulae A particle P moves along the ar-axis with constant 
acceleration a in the positive .v-direction. Initially P is at the origin and is moving with velocity 
u in the positive ^-direction. Show that the velocity v and displacement x of P at time t arc 
given by* 


v = u + at, 


x = ut + \at 2 , 


and deduce that 


v 2 = u 2 + lax. 


In a standing quarter mile test, the Suzuki Bandit 1200 motorcycle covered the quarter 
mile (from rest) in 11.4 seconds and crossed the finish line doing 116 miles per hour. Are 
these figures consistent with the assumption of constant acceleration? 

General particle motion 

2.4 The trajectory of a charged particle moving in a magnetic field is given by 

r = b cos Sit i + b sin Sit j + ct k, 

where b, £2 and c are positive constants. Show that the particle moves with constant speed and 
find the magnitude of its acceleration. 

2 . 5 Acceleration due to rotation and orbit of the Earth A body is at rest at a location on the 
Earth’s equator. Find its acceleration due to the Earth’s rotation. [Take the Earth’s radius at 
the equator to be 6400 km.] 

Find also the acceleration of the Earth in its orbit around the Sun. [Take the Sun to be 
fixed and regard the Earth as a particle following a circular path with centre the Sun and radius 
15 x 10 10 m. 

2.6 An insect flies on a spiral trajectory such that its polar coordinates at time t are given by 

/• = be nt , 0 = Sit, 

where b and £2 are positive constants. Find the velocity and acceleration vectors of the insect 
at time t, and show that the angle between these vectors is always tt/4. 

2.7 A racing car moves on a circular track of radius b. The car starts from rest and its speed 
increases at a constant rate a. Find the angle between its velocity and acceleration vectors at 
time /. 


* These are the famous constant acceleration formulae. Although they are a mainstay of school mechan¬ 
ics, we will make little use of them since, in most of the problems that we treat, the acceleration is not 
constant. It is a serious offence to use these formulae in non-constant acceleration problems. 
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2.8 A particle P moves on a circle with centre O and radius b. At a certain instant the speed 
of P is v and its acceleration vector makes an angle a with P O. Find the magnitude of the 
acceleration vector at this instant. 

2.9* A bee flies on a trajectory such that its polar coordinates at time t are given by 



0 = - (0 < t < 2r) 


T 


T 


where b and r are positive constants. Find the velocity vector of the bee at time t. 

Show that the least speed achieved by the bee is b/x. Find the acceleration of the bee at 
this instant. 

2.10* A pursuit problem: Daniel and the Lion The luckless Daniel (D) is thrown into a 
circular arena of radius a containing a lion ( L ). Initially the lion is at the centre O of the 
arena while Daniel is at the perimeter. Daniel’s strategy is to run with his maximum speed u 
around the perimeter. The lion responds by running at its maximum speed U in such a way 
that it remains on the (moving) radius OD. Show that r, the distance of L from O, satisfies 
the differential equation 



Find r as a function of t. If U > u, show that Daniel will be caught, and find how long this 
will take. 

Show that the path taken by the lion is an arc of a circle. For the special case in which 
U = u, sketch the path taken by the lion and find the point of capture. 

2.11 General motion with constant speed A particle moves along any path in three- 
dimensional space with constant speed. Show that its velocity and acceleration vectors must 
always be perpendicular to each other. [Hint. Differentiate the formula v • v = v 2 with respect 
tot.] 

2.12 A particle P moves so that its position vector r satisfies the differential equation 


r = cxr , 


where c is a constant vector. Show that P moves with constant speed on a circular path. [Hint. 
Take the dot product of the equation first with c and then with r.] 

Angular velocity 

2.13 A large truck with double rear wheels has a brick jammed between two of its tyres which 
arc 4 ft in diameter. If the truck is travelling at 60 mph, find the maximum speed of the brick 
and the magnitude of its acceleration. [Express the acceleration as a multiple of g = 32 fts -2 .] 

2.14 A particle is sliding along a smooth radial grove in a circular turntable which is rotating 
with constant angular speed £2. The distance of the particle from the rotation axis at time t is 
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P 



FIGURE 2.12 Crank and piston mechanism 
observed to be 


r = b cosh Qt 


for 1 > 0, where b is a positive constant. Find the speed of the particle (relative to a fixed 
reference frame) at time t, and find the magnitude and direction of the acceleration. 

2.15 Figure 2.11 shows an eccentric circular cam of radius b rotating with constant angular 
velocity co about a fixed pivot O which is a distance e from the centre C. The cam drives a 
valve which slides in a straight guide. Find the maximum speed and maximum acceleration of 
the valve. 

2.16 Figure 2.12 shows a piston driving a crank OP pivoted at the end O. The piston slides 
in a straight cylinder and the crank is made to rotate with constant angular velocity co. Find the 
distance O Q in terms of the lengths b, c and the angle 0. Show that, when b/c is small, O Q 
is given approximately by 


b 2 . 2 

OQ = c + b cos 0 -sin" 0, 

2c 


on neglecting (b/c) 4 and higher powers. Using this approximation, find the maximum accel¬ 
eration of the piston. 


2.17 Figure 2.13 shows an epicyclic gear arrangement in which the ‘sun’ gear Q\ of radius 
b\ and the ‘ring’ gear Q 2 of inner radius ^2 rotate with angular velocities a>\, a >2 respectively 
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FIGURE 2.13 Epicyclic gear mechanism 



FIGURE 2.14 The pins P and Q at the ends 
of a rigid link move along the axes OX, OY 
respectively. 



about their fixed common centre O . Between them they grip the ‘planet’ gear Q, whose centre 
C moves on a circle centre O. Find the circumferential velocity of C and the angular velocity 
of the planet gear Q. If O and C were connected by an arm pivoted at O, what would be the 
angular velocity of the arm? 

2.18 Figure 2.14 shows a straight rigid link of length a whose ends contain pins P, Q that 
are constrained to move along the axes OX, OY. The displacement x of the pin P at time t 
is prescribed to be x = b sin £11, where b and £2 are positive constants with b < a. Find the 
angular velocity co and the speed of the centre C of the link at time t. 

Relative velocity 

2.19 An aircraft is to fly from a point A to an airfield B 600 km due north of A. If a steady 
wind of 90 km/h is blowing from the north-west, find the direction the plane should be pointing 
and the time taken to reach B if the cruising speed of the aircraft in still air is 200 km/h. 

2.20 An aircraft takes off from a horizontal runway with constant speed U , climbing at a 
constant angle a to the horizontal. A car is moving on the runway with constant speed u 
directly towards the front of the aircraft. The car is distance a from the aircraft at the instant 
of take-off. Find the distance of closest approach of the car and aircraft. [Don’t try this one at 
home.] 

2.21 * An aircraft has cruising speed v and a flying range (out and back) of Rq in still air. 
Show that, in a north wind of speed u (u < v ) its range in a direction whose true bearing from 
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FIGURE 2.15 The dog D chases the hare H 
by running directly towards the hare’s current 
position. 



north is 0 is given by 

/?o(v 2 - ti 2 ) 
v(v 2 — u 2 sin 2 0) 1 / 2 

What is the maximum value of this range and in what directions is it attained? 

Computer assisted problems 

2.22 Dog chasing a hare; another pursuit problem. Figure 2.15 shows a dog with position 
vector r D and velocity v D chasing a hare with position vector r 11 and velocity v H . The dog’s 
strategy is to run directly towards the current position of the hare. Given the motion of the hare 
and the speed of the dog, what path does the dog follow? 

Since the dog runs directly towards the hare, its velocity v ,:> must satisfy 


In terms of the position vector of the dog relative to the hare, given by R = r D — r 11 , this equation becomes 

R = -* V »- V ". 

R 

Given the velocity v ff of the hare and the speed v D of the dog as functions of time, this differential equation 
determines the trajectory of the dog relative to the hare; capture occurs when R = 0 . The actual trajectory 
of the dog is given by r D = R + r H . 

If the motion takes place in a plane with R = X i + Y j then X and Y satisfy the coupled differential 
equations 


(X 2 + y2)l/2 U "’ Y ~ ( x2 + y2)l/2 

together with initial conditions of the form X(0) = xq and Y( 0) = vo- Such equations cannot usually 
be solved analytically but are extremely easy to solve with computer assistance. Two interesting cases to 
consider are as follows. In each case the speeds of the dog and the hare are constants. 

(i) Initially the hare is at the origin with the dog at some point (xq, vq). The hare then runs along the 
positive x-axis and is chased by the dog. Show that the hare gets caught if v D > v H y but when v D = v H 
the dog always misses (unless he starts directly in the path of the hare). This remarkable result can be 
proved analytically. 
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(ii) The hare runs in a circle (like the lion problem). In this case, with v D = u , the dog seems to 
miss no matter where he starts. 

Try some examples of your own and see if you can find interesting paths taken by the dog. 

2.23 Consider further the piston problem described in Problem 2.16. Use computer assistance 
to calculate the exact and approximate accelerations of the piston as functions of 0. Compare 
the exact and approximate formulae (non-dimensionalised by co 2 b) by plotting both on the 
same graph against 0. Show that, when b/c < 0.5, the two graphs arc close, but when b/c 
gets close to unity, large errors occur. 
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Newton’s laws of motion 
and the law of gravitation 


KEY FEATURES 

The key features of this chapter are Newton’s laws of motion, the definitions of mass and 
force, the law of gravitation, the principle of equivalence, and gravitation by spheres. 


This chapter is concerned with the foundations of dynamics and gravitation. Kine¬ 
matics is concerned purely with geometry of motion, but dynamics seeks to answer the 
question as to what motion will actually occur when specified forces act on a body. The 
rules that allow one to make this connection are Newton’s laws of motion. These are 
laws of physics that are founded upon experimental evidence and stand or fall accord¬ 
ing to the accuracy of their predictions. In fact, Newton’s formulation of mechanics has 
been astonishingly successful in its accuracy and breadth of application, and has survived, 
essentially intact, for more than three centuries. The same is true for Newton’s universal 
law of gravitation which specifies the forces that all masses exert upon each other. 

Taken together, these laws represent virtually the entire foundation of classical 
mechanics and provide an accurate explanation for a vast range of motions from large 
molecules to entire galaxies. 

3.1 NEWTON’S LAWS OF MOTION 

Isaac Newton’s* three famous laws of motion were laid down in Principia, written 
in Latin and published in 1687. These laws set out the founding principles of mechanics 
and have survived, essentially unchanged, to the present day. Even when translated into 
English, Newton’s original words are hard to understand, mainly because the terminology 


* Sir Isaac Newton (1643-1727) is arguably the greatest scientific genius of all time. His father was com¬ 
pletely uneducated and Isaac himself had no contact with advanced mathematics before the age of twenty. 
However, by the age of twenty seven, he had been appointed to the Lucasian chair at Cambridge and was 
one of the foremost scientists in Europe. His greatest achievements were his discovery of the calculus, his 
laws of motion, and his theory of universal gravitation. On the urging of Halley (the Astronomer Royal), 
Newton wrote up an account of his new physics and its application to astronomy. Philosophiae Naturalis 
Principia Mathematica was published in 1687 and is generally recognised as the greatest scientific book 
ever written. 
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of the seventeenth century is now archaic. Also, the laws are now formulated as applying 
to particles, a concept never used by Newton. A particle is an idealised body that occupies 
only a single point of space and has no internal structure. True particles do not exist^ in 
nature, but it is convenient to regard realistic bodies as being made up of particles. Using 
modern terminology, Newton’s laws may be stated as follows: 


Newton’s laws of motion 

First Law When all external influences on a particle are removed, the particle 
moves with constant velocity. [This velocity may be zero in which case the particle 
remains at rest.] 

Second Law When a force F acts on a part icle of mass /?/, the particle moves with 
instantaneous acceleration a given by the formula 

F = ma, 

where the unit of force is implied by the units of mass and acceleration. 

Third Law When two particles exert forces upon each other, these forces are (i) 
equal in magnitude, (ii) opposite in direction, and (iii) parallel to the straight line 
joining the two particles. 


Units 

Any consistent system of units can be used. The standard scientific units are SI units in 
which the unit of mass is the kilogram, the unit of length is the metre, and the unit of 
time is the second. The unit of force implied by the Second Law is called the newton, 
and written N. An excellent description of the SI system of units can be found on 

http://www.physics.nist.gov/PhysRefData 

the website of the US National Institute of Standards & Technology. 

In the Imperial system of units, the unit of mass is the pound, the unit of length is the 
foot, and the unit of time is the second. The unit of force implied by the Second Law is 
called the poundal. These units are still used in some industries in the US, a fact which 
causes frequent confusion. 

Interpreting Newton’s laws 

Newton’s laws are clear enough in themselves but they leave some important questions 
unanswered, namely: 

(i) In what frame of reference are the laws true? 


' The nearest thing to a particle is the electron, which, unlike other elementary particles, does seem to be a 
point mass. The electron does however have an internal structure, having spin and angular momentum. 
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(ii) What are the definitions of mass and force? 

These questions are answered in the sections that follow. What we do is to set aside 
Newton’s laws for the time being and go back to simple experiments with particles. These 
are Thought experiments’ in the sense that, although they are perfectly meaningful, they 
are unlikely to be performed in practice. The supposed ‘results’ of these experiments are 
taken to be the primitive governing laws of mechanics on which we base our definitions of 
mass and force. Finally, these laws and definitions are shown to be equivalent to Newton’s 
laws as stated above. This process could be said to provide an interpretation of Newton’s 
laws. The interpretation below is quite sophisticated and is probably only suitable for 
those who have already seen a simpler account, such as that given by French [3]. 

3.2 INERTIAL FRAMES AND THE LAW OF INERTIA 

The first law states that, when a particle is unaffected by external influences, it 
moves with constant velocity, that is, it moves in a straight line with constant speed. 
Thus, contrary to Aristotle’s view, the particle needs no agency of any kind to maintain 
its motion.* Since the influence of the Earth’s gravity rules out any verification of the 
First Law by an experiment conducted on Earth, Newton showed remarkable insight in 
proposing a law he could not possibly verify. In order to verify the First Law, all external 
influences must be removed, which means that we must carry out our thought experiment 
in a place as remote as possible from any material bodies, such as the almost empty space 
between the galaxies. In our minds then we go to such a place armed with a selection of 
test particles 1 which we release in various ways and observe their motion. According to 
the First Law, each of these particles should move with constant velocity. 

Inertial reference frames 

So far we have ignored the awkward question as to what reference frame we should use to 
observe the motion of our test particles. When confronted with this question for the first 
time, one’s probable response is that the reference frame should be ‘fixed’. But fixed to 
what? The Earth rotates and is in orbital motion around the Sun. Our entire solar system 
is part of a galaxy that rotates about its centre. The galaxies themselves move relative 
to each other. The fact is that everything in the universe is moving relative to everything 
else and nothing can properly be described as fixed. From this it might be concluded that 
any reference frame is as good as any other, but this is not so, for, if the First Law is 
true at all, it can only be true in certain special reference frames. Suppose for instance 
that the First Law has been found to be true in the reference frame T . Then it is also 
true in any other frame T' that is mutually unaccelerated relative to T (see section 2.6). 
This follows because, if the test particles have constant velocities in F, then they have 


* Such a law was proposed prior to Newton by Galileo but, curiously, Galileo did not accept the conse¬ 
quences of his own statement. 

' Since true particles do not exist, we will have to make do with uniform rigid spheres of various kinds. 
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zero accelerations in T. But, since T and T' are mutually unaccelerated frames, the 
test particles must have zero accelerations in T' and thus have constant velocities in T'. 
Moreover, the First Law does not hold in any other reference frame. 

Definition 3.1 Inertial frame A reference frame in which the First Law is true is said 
to be an inertial frame. 

It follows that, //‘there exists one inertial frame, then there exist infinitely many, with 
each frame moving with constant velocity (and no rotation) relative to any other. 

It may appear that the First Law is without physical content since we are saying that 
it is true in those reference frames in which it is true. However, this is not so since inertial 
frames need not have existed at all, and the fact that they do is the real physical content of 
the First Law. Why there should exist this special class of reference frames in which the 
laws of physics take simple forms is a very deep and interesting question that we do not 
have to answer here! 

Our discussion is summarised by the following statement which we take to be a law 
of physics: 

The law of inertia There exists in nature a unique class of mutually unaccelerated refer¬ 
ence frames (the inertial frames) in which the First Law is true. 

Practical inertial frames 

The preceding discussion gives no clue as to how to set up an inertial reference frame and, 
in practice, exact inertial frames are not available. Practical reference frames have to be tied 
to real objects that arc actually available. The most common practical reference frame is the 
Earth. Such a frame is sufficiently close to being inertial for the purpose of observing most 
Earth-bound phenomena. The orbital acceleration of the Earth is insignificant and the effect of 
the Earth’s rotation is normally a small correction. For example, when considering the motion 
of a football, a pendulum or a spinning top, the Earth may be assumed to be an inertial frame. 

However, the Earth is not a suitable reference frame from which to observe the motion of 
an orbiting satellite. In this case, the geocentric frame (which has its origin at the centre of 
mass of the Earth and has no rotation relative to distant stars) would be appropriate. Similarly, 
the heliocentric frame (which has its origin at the centre of mass of the solar system and has 
no rotation relative to distant stars) is appropriate when observing the motion of the planets. 

Example 3.1 Inertial frames 

Suppose that a reference frame fixed to the Earth is exactly inertial. Which of the 
following arc then inertial frames? 

A frame fixed to a motor car which is 

(i) moving with constant speed around a flat race track, 

(ii) moving with constant speed along a straight undulating road, 

(iii) moving with constant speed up a constant gradient, 

(iv) freewheeling down a hill. 
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FIGURE 3.1 The particles P\ and Pi move under their 
mutual interaction and, relative to an inertial reference 
frame T, have accelerations ci\i and ai\ repectively. 

These accelerations are found to satisfy the law of mutual 
interaction. 

Solution 

Only (iii) is inertial. In the other cases, the frame is accelerating or rotating relative 
to the Earth. 

3.3 THE LAW OF MUTUAL INTERACTION; MASS AND 
FORCE 

We first dispose of the question of what frame of reference should be used to 
observe the particle motions mentioned in the Second Law. The answer is that any inertial 
reference frame can be used and we will always assume this to be so, unless stated 
otherwise. As stated earlier, the problem in understanding the Second and Third Laws is 
that the concepts of mass and force arc not defined, which is obviously unsatisfactory. 

Our second thought experiment is concerned with the motion of a pair of mutually 
interacting particles. The nature of their mutual interaction can be of any kind* and all 
other influences are removed. Since each particle is influenced by the other, the First Law 
does not apply. The particles will, in general, have accelerations, these being independent 
of the inertial frame in which they are measured. Our second law of physics is concerned 
with the ‘observed’ values of these mutually induced accelerations. 

The law of mutual interaction Suppose that two particles P\ and Pi interact with each 
other and that Pi induces an instantaneous acceleration a\i in P\, while P\ induces an 
instantaneous acceleration ai\ in Pi. Then 

(i) these accelerations are opposite in direction and parallel to the straight line joining 
P\ and Pi, 


* The mutual interaction might be, for example, (i) mutual gravitation, (ii) electrostatic interaction, caused 
by the particles being electrically charged, or (iii) the particles being connected by a fine elastic cord. 
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(ii) the ratio of the magnitudes of these accelerations, |«2i |/|* *f 12 I is « constant inde¬ 
pendent of the nature of the mutual interaction between P\ and Pi, and indepen¬ 
dent of the positions and velocities * of P \ and Pi. 

Moreover, suppose that when Pi interacts with a third particle P 3 the induced accelera¬ 
tions are a 23 and an, and when P\ interacts with P 3 the induced accelerations are a 13 
and « 3 i. Then the magnitudes of these accelerations satisfy the consistency relation * 

I**211 l«32l l**13l , 

- x - x -=1. (3.1) 

l« 12l l«23l I **311 


Definition of inertial mass 

The law of mutual interaction leads us to our definitions of mass and force. The qualita¬ 
tive definition of the (inertial) mass of a particle is that it is a numerical measure of the 
reluctance of the particle to being accelerated. Thus, when particles P\ and Pi interact, 
we attribute the fact that the induced accelerations an and ai\ have different magnitudes 
to the particles having different masses. This point of view is supported by the fact that the 
ratio |<*21 I/I** 12 1 depends only upon the particles themselves, and not on the interaction, 
or where the particles are, or how they are moving. We define the mass ratio m\/mi of 
the particles P\, Pi to be the inverse ratio of the magnitudes of their mutually induced 
accelerations, as follows: 

Definition 3.2 Inertial mass The mass ratio m\/mi of the particles P\, Pi is defined 
to be 


nil _ l**2iI 
m 2 |fl I2 |- 


(3.2) 


There is however a possible inconsistency in this definition of mass ratio. Suppose 
that we introduce an third particle P 3 . Then, by performing three experiments, we could 
independently determine the three mass ratios m \ /mi, mi/ms and m 3/7711 and there is no 
guarrantee that the product of these three ratios would be unity. However, the consistency 
relation (3.1) assures us that it would be found to be unity, and this means that the above 
definition defines the mass ratios of particles unambiguously. 

In order to have a numerical measure of mass, we simply choose some particle A as 
the reference mass (having mass one unit), in which case the mass of any other particle 
can be expressed as a number of ‘ A -units’. If we were to use a different particle B as the 
reference mass, we would obtain a second measure of mass in B -units, but this second 
measure would just be proportional to the first, differing only by a multiplied constant. In 
SI units, the reference body (having mass one kilogram) is a cylinder of platinum iridium 
alloy kept under carefully controlled conditions in Paris. 


7 This is true when relativistic effects arc negligible. 

* The significance of the consistency relation will be explained shortly. 
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Example 3.2 A strange definition of mass 

Suppose the mass ratio /// 2/111 1 were defined in some other way, such as 

in 1 / |«2i| \ 1/2 

in2 \ l« 12I / 

Is this just as good as the standard definition? 

Solution 

For some purposes it would be just as good. It would lead to the non-standard form 

F = m 2 a 

for the second law, and, for the motion of a single particle, the theory would be essen¬ 
tially unaffected. We will see later however that, if mass were defined in this way, then 
the mass of a multi-particle system would not be equal to the sum of the masses of its 
constituent particles! This is not contradictory, but it is a very undesirable feature and 
explains why the standard definition is used. ■ 

Definition of force 

We now turn to the definition of force. Qualitatively, the presence of a force is the reason 
we give for the acceleration of a particle. Thus, when interacting particles cause each 
other to accelerate, we say it is because they exert forces upon each other. How do we 
know that these forces are present? Because the particles are accelerating! These state¬ 
ments are obviously circular and without real content. Force is therefore a quantity of 
our own invention, but a very useful one nonetheless and an essential part of the New¬ 
tonian formulation of classical mechanics. It should be noted though that the concept of 
force is not an essential part of the Lagrangian or Hamiltonian formulations of classical 
mechanics.* 

In mutual interactions, the forces that the particles exert upon each other are defined 
as follows: 

Definition 3.3 Force Suppose that the particles P\ and Pi are in mutual interaction 
and have accelerations a 12 and ai\ respectively. Then the force F 12 that Pi exerts on P \, 
and the force Fi\ that P\ exerts on Pi are defined to he 

F\2 = m\a\2, F 21 = 1112021 , (3.3) 

where the unit of force is implied by the units of mass and acceleration. 

It follows that, in the case of two-particle interactions, the Second Law is true by 
the definition of force. Also, since a 12 and #21 arc opposite in direction and are parallel 


* This fact is important when making connections between classical mechanics and other theories, such as 
general relativity or quantum mechanics. The concept of force does not appear in either of these theories. 
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FIGURE 3.2 The law of multiple interactions. In the presence of interactions 
from the particles P\ , Pi* P 3, the acceleration ao of particle Pq is given by 
«0 = «01 + a 02 + a 03- 


to the line P\ P2, so then are F i 2 and F 21; thus parts (ii) and (iii) of the Third Law are 
automatically true. Furthermore 

|F| 2 | = Wt 11 121 = w 2 |«2ll = 1^211» 

on using the definition ( 3 . 2 ) of the mass ratio m\/mi. Thus part (i) of the Third Law is 
also true. Hence the law of mutual interaction, together with our definitions ( 3 . 2 ), ( 3 . 3 ) of 
mass and force, implies the truth of the Second and Third Laws. 

3.4 THE LAW OF MULTIPLE INTERACTIONS 

Our third and final thought experiment is concerned with what happens when a 
particle is subject to more than one interaction. 

The law of multiple interactions Suppose the particles Pq, P\,... P n are interacting with 
each other and that all other influences are removed. Then the acceleration ao induced in 
Pq can be expressed as 


«o = 001 + 002 +-b «0iv, ( 3 . 4 ) 

where aoi, ao2 • • • arc the accelerations that Pq would have if the particles P \, Pi,... were 
individually interacting with Pq. 

This result is sometimes expressed by saying that interaction forces act indepen¬ 
dently of each other. It follows that 

/7/o0o = uiq (aoi + 002 + • • • + 0oaO 
= ^01 + F 02 + ••• + F on , 

on using the definition ( 3 . 3 ) of mutual interaction forces. Thus the Second Law remains 
true for multiple interactions provided that the ‘effective force' F 0 acting on Pq is under¬ 
stood to mean the (vector) resultant of the individual interaction forces acting on Pq, that 
is 


F 0 = ^01 + Fq 2 -I--h FON- 
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This result is not always thought of as a law of physics, but it is.* It could have been 
otherwise! 


Experimental basis of Newton’s Laws 

1. We accept the law of inertia, the law of mutual interaction and the law of 
multiple interactions as the ‘experimental’ basis of mechanics. 

2. Together with our definitions of mass and force, these experimental laws imply 
that Newton’s laws are true in any inertial reference frame. 


3.5 CENTRE OF MASS 

We can now introduce the notion of the centre of mass of a collection of particles. 
Suppose we have a system of particles P\ , Pi, ..., Pn with masses m \ , m2, .. . , ni^, and 
position vectors r\, ri ,..., r^ respectively. Then: 

Definition 3.4 Centre of mass The centre of mass of this system of particles is the 
point of space whose position vector R is defined by 

R nr\T\ + w 2 f2 H- Vm N r N Y!t=\ m i r i 35 

nn+m 2 + ••■+m N m,- M ' 

where M is the sum of the separate masses. 

The centre of mass of a system of particles is simply a ‘weighted’ mean of the position 
vectors of the particles, where the ‘weights’ are the particle masses. Centre of mass is an 
important concept in the mechanics of multi-particle systems. Unfortunately, there is a 
widespread belief that the centre of mass has a magical ability to describe the behaviour 
of the system in all circumstances. This is simply not true. For instance, we will show in 
the next section that it is not generally true that the total gravitational force that a system 
of masses exerts on a test mass is equal to the force that would be exerted by a particle of 
mass M situated at the centre of mass. 

Example 3.3 Finding centres of mass 

Find the centre of mass of (i) a pair of particles of different masses, (ii) three identical 
particles. 

Solution 

(i) For a pair of particles P\, Pi, the position vector of the centre of mass is given by 

ni\r\ + //f 2 T2 

A = -. 

ill 1 + ill 2 


* It certainly does not follow from the observation that ‘forces are vector quantities’! 
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It follows that the centre of mass lies on the line P\ Pi and divides this line in the ratio 
m 2 '■ in i. 

(ii) For three identical particles P\, Pi, P3, the position vector of the centre of mass 
is given by 


mr\ + mri + /»r 3 n+^ + r3 

A = - = -. 

m + in + 111 3 

It follows that the centre of mass lies at the centroid of the triangle P\ P1P3. ■ 

The centres of mass of most of the systems we meet in mechanics are easily deter¬ 
mined by symmetry considerations. However, when symmetry is lacking, the position of 
the centre of mass has to be worked out from first principles by using the definition ( 3 . 5 ), 
or its counterpart for continuous mass distributions. The Appendix at the end of the book 
contains more details and examples. 


3.6 THE LAW OF GRAVITATION 

Physicists recognise only four distinct kinds of interaction forces that exist in 
nature. These are gravitational forces, electromagnetic forces and weak/strong nuclear 
forces. The nuclear forces are important only within the atomic nucleus and will not 
concern us at all. The electromagnetic forces include electrostatic attraction and repul¬ 
sion, but we will encounter them mainly as ‘forces of contact’ between material bod¬ 
ies. Since such forces are intermolecular, they are ultimately electromagnetic although 
wc will make no use of this fact! The present section however is concerned with 
gravitation. 

It is an observed fact that any object with mass attracts any other object with mass 
with a force called gravitation. When gravitational interaction occurs between particles, 
the Third Law implies that the interaction forces must be equal in magnitude, opposite 
in direction and parallel to the straight line joining the particles. The magnitude of the 
gravitational interaction forces is given by: 


The law of gravitation 

The gravitational forces that two particles exert upon each other each have magni¬ 
tude 


m\tniG 

~R^~ 


( 3 . 6 ) 


where m \, m2 are the particle masses, R is the distance between the particles, and 
G, the constant of gravitation, is a universal constant. Since G is not dimensionless, 
its numerical value depends on the units of mass, length and force. 
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This is the famous inverse square law of gravitation originally suggested by Robert 
Hooke,* a scientific contemporary (and adversary) of Newton. In SI units, the constant of 
gravitation is given approximately by 

G = 6.67 x 10“ 11 N m 2 kg -2 , (3.7) 

this value being determined by observation and experiment. There is presently no theory 
(general relativity included) that is able to predict the value of G. Indeed, the theory of 
general relativity does not exclude repulsion between masses! 

To give some idea of the magnitudes of the forces involved, suppose we have two 
uniform spheres of lead, each with mass 5000 kg (five metric tons). Their common radius 
is about 47 cm which means that they can be placed with their centres 1 m apart. What 
gravitational force do they exert upon each other when they are in this position? We 
will show later that the gravitational force between unifonn spheres of matter is exactly 
the same as if all the mass of each sphere were concentrated at its centre. Given that 
this result is true, we can find the force that each sphere exerts on the other simply by 
substituting m \ = m 2 = 5000 and R = 1 into equation (3.6). This gives F = 0.00167 N 
approximately, the weight of a few grains of salt! Such forces seem insignificant, but 
gravitation is the force that keeps the Moon in orbit around the Earth, and the Earth in 
orbit around the Sun. The reason for this disparity is that the masses involved are so much 
larger than those of the lead spheres in our example. For instance, the mass of the Sun is 
about 2 x 1O 30 kg. 

3.7 GRAVITATION BY A DISTRIBUTION OF MASS 

It is important to be able to calculate the gravitational force exerted on a particle 
by a distribution of mass, such as a disc or sphere. The Earth, for example, is an approxi¬ 
mately spherical mass distribution. We first treat an introductory problem of gravitational 
attraction by a pair of particles and then progress to continuous distributions of matter. 
In all cases, the law of multiple interactions means that the effective force exerted on a 
particle is the resultant of the individual forces of interaction exerted on that particle. 

Example 3.4 Attraction by a pair of particles 

A particle C, of mass m, and two particles A and B, each of mass M, are placed as 

shown in Figure 3.3. Find the gravitational force exerted on the particle C. 

Solution 

By the law of gravitation, each of the particles A and B attracts C with a force of 

magnitude F' where 

, m MG 

F — - 

*2 ’ 


* It was Newton however who proved that Kepler’s laws of planetary motion follow from the inverse square 
law. 
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FIGURE 3.3 The particle C, of mass m, is 
attracted by the particles A and B, each of 
mass M . The resultant force on C points 
towards O. 



where R = (a 2 + jc 2 ) 1 / 2 is the distance AC (= BC). By symmetry, the resultant 
force F points in the direction C O and so its magnitude F can be found by summing 
the components of the contributing forces in this direction. Hence 


F = 


2m M G cos a 


R 2 


= 2m MG 


R cos a 
R* 


) = 2mMG ( —-- ) 

) \(« 2 + -* 2 ) 3/2 / 


for .v > 0. [The angle a is shown in Figure 3.3.] Thus the resultant force exerted on 
C looks nothing like the force exerted by a single gravitating particle. In particular, it 
is not equal to the force that would be exerted by a mass 2M placed at O. However, 
on writing F in the form 


F = 


2m M G 



we see that 


m(2M)G 

F - -2- 

x z 

when x/a is large. Thus, when C is very distant from A and B , the gravitational 
force exerted on C is approximately the same as that of a single particle of mass 2 M 
situated at O. 

The graph of the exact value of F as a function of x is shown in Figure 3.4. 
Dimensionless variables arc used. F = 0 when x = 0, and rises to a maximum when 
x = a/\/2 where F = 4/»MG/3v / 3« 2 . Thereafter, F decreases, becoming ever 
closer to its asymptotic form m(2M)G /x 2 . ■ 

General asymptotic form of F as /• -> oo 

The asymptotic result in the last example is true for attraction by any bounded* distribution 
of mass. The general result can be stated as follows: 


* This excludes mass distributions that extend to infinity, such as an infinite straight wire. 
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FIGURE 3.4 The dimensionless resultant force 
(rnMG /a 2 )~ x F plotted against ,v /a. 


Let S be any bounded system of masses with total mass M. Then the force F exerted 
by *5 on a particle P , of fit ass m and position vector* r, has the asymptotic form 


F 


mMG 


,2 


r , 


as r —> o c, where r = |r| andT = r/r. 

In other words, the force exerted by S on a distant particle is approximately the same 
as that exerted by a particle of mass equal to the total mass of S, situated at the centre of 
mass of S. 


Example 3.5 Gravitation by a uniform rod 

A particle P , of mass ///, and a uniform rod, of length 2 a and mass M, arc placed as 
shown in Figure 3.5. Find the gravitational force that the rod exerts on the particle. 

Solution 

Consider the element [.v, x + dx] of the rod which has mass M dx/2a and exerts 
an attractive force of magnitude 

m{M dx/2a)G 

Rl 

on P , where R is the distance shown in Figure 3.5. By symmetry, the resultant force 
acts towards the centre O of the rod and can be found by summing the components of 
the contributing forces in the direction P O. Since the rod is a continuous distribution 


* The result, as stated, is true for any choice of the origin O of position vectors. However, the asymptotic 
error is least if O is located at the centre of mass of S. In this case the relative error is of order (a/r) , 
where a is the maximum ‘radius’ of the mass distribution about O . 
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FIGURE 3.5 A particle P, of mass m, is 
attracted by a uniform rod of length 2 a and 
mass M . The resultant force F on P points 
towards the centre 0 of the rod. 


dx 



of mass, this sum becomes an integral. The resultant force exerted by the rod thus has 
magnitude F given by 

m MG f a cos a m MG C a R cos a 

F = - / — dx = - / -r — dx 

2 a J_ a R- 2 a J_ a R 3 

mMG f a b dx 

2a J_ a (x 2 + b 2 ) 3 / 2 ' 

where b is the distance of P from the centre of the rod. This integral can be evaluated 
by making the substitution x = b tan 0, the limits on 0 being 0 = ±/?, where tan fi = 
a/b. This gives 

hi MG /2sin£\ in MG / 2 a \ 

2a \ b ) 2a \b(b 2 + a 2 )^ 2 ) 
in MG 

b(b 2 + « 2 )‘/2 


Example 3.6 Gravitation by a uniform disk 

A particle P, of mass in, is situated on the axis of a uniform disk, of mass M and 
radius a , as shown in Figure 3.6. Find the gravitational force that the disk exerts on 
the particle. 

Solution 

Consider the clement of area dA of the disk which has mass M dA/na 2 and 
attracts P with a force of magnitude 

ni(M dA/na 2 )G 
R 2 ’ 

where R is the distance shown in Figure 3.6. By symmetry, the resultant force acts 
towards the centre O of the disk and can be found by summing the components of the 
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FIGURE 3.6 Left: A particle P , of mass m, is attracted by a uniform disk of mass M and 
radius a. The resultant force F on P points towards the centre O of the disk. Right: The 
element of area cl A in polar cordinares r, 0. 


contributing forces in the direction P O. The resultant force exerted by the disk thus 
has magnitude F given by 


F = 


m MG C cos cr 

K~R rdA 


where the integral is to be taken over the region A occupied by the disk. This integral 
is most easily evaluated using polar coordinates. In this case cl A = {dr)(r dO) = 
r dr dO, and the integrand becomes 


cos a R cos a b 

~R^ = _= 0-2 + / , 2 ) 3 / 2 ’ 


where b is the distance of P from the centre of the disk. The ranges of integration for 
r, 0 are 0 < r < a and 0 < 0 <2n. We thus obtain 


m MG 


rca‘ 


v—a r0=2n 


nr—a nv-- 

Jr =0 J0= 


0 V ('- 2 + 6 2 ) 3/2 


-) r dr 


dO. 


Since the integrand is independent of 0, the ^-integration is trivial leaving 
mMG C r=a 2nbrdr 2mMG 


r= a 

Jr= 0 0 * 
2mMG T 


2 + £2)3/2 

b 


[_£ (r 2 + £ 2 ) -./ 2 ]- 


(< T 2 + 0 2 )'/ 2 


Gravitation by spheres 

Because of its applications to astronomy and space travel, and because we live on a nearly 
spherical body, gravitation by a spherical mass distribution is easily the most important 
case. We suppose that the mass distribution occupies a spherical volume and is also spher¬ 
ically symmetric so that the mass density depends only on distance from the centre of the 
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FIGURE 3.7 Left: A particle P, of mass m, is attracted by a symmetric sphere of radius a and total 
mass M. Right: The element of volume dV in spherical polar coordinates r , 9, <p. 


sphere. Wc call such a body a symmetric sphere. The fact that we do not require the 
density to be uniform is very important in practical applications. The Earth, for instance, 
has a density of about 3,000 kgm -3 near its surface, but its density at the centre is about 
16,500kg m -3 . Similar remarks apply to the Sun. Thus, if our results were restricted to 
spheres of uniform density, they would not apply to the Earth or the Sun, the two most 
important cases. 

The fundamental result concerning gravitation by a symmetric sphere was proved by 
Newton himself and confirmed his universal theory of gravitation. It is presented here as 
a theorem. 

Theorem 3.1 The gravitational force exerted hv a symmetric sphere of mass M on a 
particle external to itself is exactly the same as if the sphere were replaced by a particle 
of mass M located at the centre. 

Proof. Figure 3.7 shows a symmetric sphere with centre O and radius a, and a particle P , 
of mass m, exterior to the sphere. We wish to calculate the force exerted by the sphere on the 
particle. The calculation is similar to that in the ‘disk’ example, but this time the integration 
must be carried out over the spherical volume occupied by the mass distribution. 

Consider the element of volume dv of the sphere which has mass p dv and attracts P with 
a force of magnitude 


m(p dv)G 
~R^ 


where R is the distance shown in Figure 3.7. By symmetry, the resultant force acts towards 
the centre O of the sphere and can be found by summing the components of the contributing 
forces in the direction PO. The resultant force exerted by the sphere thus has magnitude F 
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given by 


F 


= niG 



p cos a 
~R^~ 


dv, 


where the integral is to be taken over the region V occupied by the sphere. This inte¬ 
gral is most easily evaluated using spherical polar coordinates r, 6, cp. In this case dv = 
(d /-)(/' d6)(r sin 0 d<j>) = r 2 sin 0 dr dO dcp, and the integrand becomes 


pcosa pRcosa p(r) (b — r cos0) 

R 2 ~ R 2 “ (; 2 + & 2 - 2/0 cos 0) 3 / 2 ’ 


on using the cosine rule R 2 = r 2 + b 2 — IrbcosQ, where b is the distance of P from the 
centre of the sphere. The ranges of integration for r, 0, cp are 0 < r < a, 0 < 9 < it and 
0 < cp <27 t . We thus obtain 


0 — TC /»</> = 2jt 


F — niG 


l'i—a /'V=7T r<p- 

Jr =0 J6 1=0 J<b= 


p(r) (b — r cos0) 

^>=0 V O' 2 + b 2 — 2rb cos 0) 3/2 


^ r 2 sin 0 dr 


dO dcp. 


This time the 0-integration is trivial leaving 


l>r—a fO-x 2np(r) (b — r cosO) 2 . 

= niG I I —^ ~ - ttk f sin OdrdO 

J r = o Je= o (r 2 + b 2 -2rb cos 0) 3 / 2 


= 2 ttiiiG 


pv—a f*0=n 

[ >' 2 p(r) I 

Jr= 0 1^=0 


(b — r cos0) sin0 dO 
(r 2 + b 2 — 2rb cos 0) 3 / 2 


dr. 


on taking the 0-integration first and the / -integration second. 

The 0-intcgration is tricky if done directly, but it comes out nicely on making the change 
of variable from 0 to R given by 

R 2 = r 2 + b 2 - 2/T/cos0, (R > 0). 


(In this change of variable, r has the status of a constant.) The range of integration for R is 
b — r < R < b + r. Then 


2 R dR = 2 rb sin 0 dO, 


b — r cos 0 — 
and the 0-integral becomes 

'b+r / p 2 _|_ ^2 _ ,.2 


2 b 2 — 2/T/cos0 R 2 + (b 2 — r 2 ) 


2b 


2b 


f h+r /R 2 + (b 2 -r 2 )\ RdR 1 r h+r ( b 2 - r 2 \ 
i-r ( 2bR 2 )-hT-2&L V + —R^) CIR ~ 


2 

b 2’ 


on pcrfomiing the now elementary integration. 
Hence 


niG 

F = -gr {4n 


£T rW) ' r ) 


and this is as far as we can go without knowing the density function p(r). The answer that we 
are looking for is that F = mMG/b 2 , where M is the total mass of the sphere. Now M can 
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also be calculated as a volume integral. Since the mass of the volume element dv is p dv, the 
total mass M is given by 



on performing the 9- and 0-integrations. 
Hence, we finally obtain 


m MG 


which is the required result. 

Since there is no reason why the density p(r) should not be zero over part of its range, this 
result also applies to the case of a particle external to a hollow sphere. The case of a particle 
inside a hollow sphere is different (see Problem 3.5). ■ 

Spheres attracted by other spheres 

Since any element of mass is attracted by a symmetric sphere as if the sphere were a parti¬ 
cle, it follows that the force that a symmetric sphere exerts on any other mass distribution 
can be calculated by replacing the sphere by a particle of equal mass located at its centre. 
In particular then, the force that two symmetric spheres exert upon each other is the same 
as if each sphere were replaced by its equivalent particle. Thus, as far as the forces of 
gravitational attraction arc concerned, symmetric spheres behave exactly as if they were 
particles. 


3.8 THE PRINCIPLE OF EQUIVALENCE AND 


Although we have so far not mentioned it, the law of gravitation hides a deep and 
very surprising fact, namely, that the force between gravitating particles is proportional 
to each of their inertial masses. Now inertial mass, as defined by equation (3.2), has no 
necessary connection with gravitation. It is a measure of the reluctance of that particle 
to being accelerated and can be determined by non-gravitational means, for instance, by 
using electrostatic interactions between the particles. It is a matter of extreme surprise 
then that a quantity that seems to have no necessary connection with gravitation actually 
determines the force of gravitation between particles. What we would have expected was 
that each particle would have a second property /?/*, called gravitational mass (not the 
same as m), which appears in the law of gravitation (3.6) and determines the gravitational 
force. For example, suppose that we have three uniform spheres of gold, silver and bronze 
and that the silver and bronze spheres have equal inert ial mass. Then the law of gravitation 
states that, when separated by equal distances, the gold sphere will attract the silver and 
bronze spheres with equal forces , whereas we would have expected these forces to be 


different. 
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FIGURE 3.8 A particle of mass M is attracted 
by the gravitation of the system S which 
consists of N particles with masses 
{wi/} (1 < i < N). 


M 



r. 

t 




The question arises then as to whether m and m* are actually equal or just nearly 
equal so that the difference is difficult to detect. Newton himself did experiments with 
pendulums made of differing materials, but could not detect any difference in the period. 
Newton’s experiment could have detected a difference of about one part in 10 3 . However, 
the classic experiment of Eotvos (1890) and its later refinements have now shown that any 
difference between m and m* is less than one part in 10 11 . This leads us to believe that m 
and m* really arc equal and that the law of gravitation means exactly what it says. 

The proposition that inertial and gravitational mass arc exactly equal is called the 
principle of equivalence. Although we accept the principle of equivalence as being true, 
we still have no explanation why this is so! In this context, it is worth remarking that 
Einstein made the principle of equivalence into a fundamendal assumption of the theory 
of general relativity. 

The gravitational acceleration g 

Suppose a particle P of mass M is under the gravitational attraction of the system S, as 
shown in Figure 3.8. Then, by the law of gravitation, the resultant force F that S exerts 
upon P is given by 


Mni\G MmjG 

-5— £i H- 5 — ^2 + 

'7 r 2 


Mm^G 
H-^— ?n 

r N 


= Mg , 


N 


M 


E m jG 


w=i r i 


where the vector g , defined by 


N 

* = E 


/=! 


m,G 



is independent of M. Then, by the Second Law, the induced acceleration a of particle P 
is determined hy the equation 


M g = Ma, 


that is. 


a = g. 
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Thus the induced acceleration g is the same for any particle situated at that point. This 
rather remarkable fact is a direct consequence of the principle of equivalence. Tradition 
has it that, prior to Newton, Galileo did experiments in which he released different masses 
from the top of the Tower of Pisa and found that they reached the ground at the same 
time. Galileo’s result is thus a colourful but rather inaccurate verification of the principle 
of equivalence! 

Gravitation by the Earth (rotation neglected) 

In the present treatment, the rotation of the Earth is neglected and we regard the Earth as 
an inertial frame of reference. A more accurate treatment which takes the Earth’s rotation 
into account is given in Chapter 17. 

When the system S is the Earth (or some other celestial body) it is convenient to 
introduce the notion of the local vertical direction. The unit vector k , which has the 
opposite direction to g, is called the vertically upwards unit vector relative to the Earth. 
In terms of k , the force exerted by the Earth on a particle of mass M is given by 

F = -Mgk, 

where the gravitational acceleration g is the magnitude of the gravitational acceleration 
vector g. Both g and k are functions of position on the Earth. 

Weight 

The positive quantity Mg (which is a function of position) is called the weight of the 
particle P. It is the magnitude of the gravity force exerted on P by the Earth. Thus the 
same body will have different weights depending upon where it is situated. However, at 
a fixed point of space, the weights of bodies are proportional to their masses. This fact, 
which is a consequence of the principle of equivalence, enables masses to be compared 
simply by comparing their weights at the same location (by using a balance, for instance). 

The approximation of uniform gravity 

It is easy to see that the Earth’s gravitational acceleration g and the vertical direction k 
depend upon position. The Earth is approximately a symmetric sphere which exerts its 
gravitational force as if all its mass were at its centre. Thus, if the value of g at a point on 
the Earth’s surface is gi, then the value of g at a height of 6,400 km (the Earth’s radius) 
must be gi/4 approximately. On the other hand, the vertical vector k changes from point 
to point on the Earth’s surface. These changes will be significant for motions whose 
extent is significant compared to the Earth’s radius; this is true for a ballistic missile, for 
instance. However, most motions taking place on Earth have an extent that is insignificant 
compared to the Earth’s radius and for which the variations of g and k are negligible. 
Simple examples include the motion of a tennis ball, a javelin or a bullet. 

The approximation in which g and k are assumed to be constants is called uniform 
gravity. Uniform gravity is the most common force field in mechanics. Many of the 
problems solved in this book make this simplifying (and accurate) approximation. 
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FIGURE 3.9 An elevator contains a ball B 
and both are freely falling under uniform 
gravity. T is an inertial reference frame and 
T' is a reference frame attached to the falling 
elevator. 



Numerical values of g 

The value of g at any location on the Earth can be measured experimentally (by using a 
pendulum for instance). The value of g is not quite constant over the Earth’s surface since 
the Earth does not quite have spherical symmetry and different locations have differing 
altitudes. At sea level on Earth, g = 9.8 ms -2 approximately, and a rough value of 
10 ms -2 is often assumed. The corresponding value for the Moon is 1.6 ms -2 , roughly 
a sixth of the Earth’s value. 


Example 3.7 Particle inside a falling elevator 

An elevator cable has snapped and the elevator and its contents are falling under 
uniform gravity. One of the passengers takes a ball from his pocket and throws it to 
another passenger.* What is the motion of the ball relative to the elevatorl 

Solution 

Suppose that the ball has mass in and that the local (vector) gravitational acceleration 
is — gk. Then the motion of the ball relative to an inertial reference frame T (fixed to 
the ground, say) is determined by the Second Law, namely, 

ma = —mgk, 

where a is the acceleration of the ball measured in T. 

Let the frame T' be attached to the elevator, as shown in Figure 3.9. Then the 
acceleration a! of the ball measured in T' is given (see section 2.6) by 

a = a ' + A, 

where A is the acceleration of the frame T' relative to J- . But the elevator, to which 
the frame T' is attached, is also moving under uniform gravity and its acceleration A 
is therefore, by the principle of equivalence, the same as that of the ball, namely, 

A = -gk. 


* People do react oddly when put under pressure. 
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Chapter Four 


Problems in particle dynamics 


KEY FEATURES 

The key features in this chapter are (i) the vector equation of motion and its reduction to 
scalar equations, (ii) motion in a force field, (iii) geometrical constraints and forces of con¬ 
straint, and (iv) linear and quadratic resistance forces. 


Particle dynamics is concerned with the problem of calculating the motion of a particle 
that is acted upon by specified forces. Our starting point is Newton’s laws. However, since 
the First Law merely tells us that we should observe the motion from an inertial frame, 
and the Third Law will never be used (since there is only one particle), the entirety of 
particle dynamics is based on the Second Law 


nia = F | + F 2 + • • • + F n •> 


where F \, F 2 ,..., Fn arc the various forces that are acting on the particle. The typical 
method of solution is to write the Second Law in the form 



f 1 + f 2 + --- + f n , 


(4.1) 


which is a first order ODE for the unknown velocity function v{t) and is called the equa¬ 
tion of motion of the particle. If the initial value of v is given, then equation (4.1) can 
often be solved to yield v as a function of the time t. Once v is determined (and if the 
initial position of the particle is given), the position vector r of the particle at time t can 
be found by solving the first order ODE dr/dt = v. The sections that follow contain 
many examples of the implementation of this method. Indeed, it is remarkable how many 
interesting problems can be solved in this way. 


Question When can real bodies be modelled as particles? 

Newton’s laws apply to particles, but real bodies arc not particles. When can real 
bodies, such as a tennis ball, a spacecraft, or the Earth, be treated as if they were 
particles? 
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FIGURE 4.1 The particle is initially at the 
origin and is projected vertically upwards 
with speed u. The particle moves in a vertical 
straight line (the axis Oz) under the uniform 
gravity force mg and possibly a resistance (or 
drag) force D. At time t the particle has 
upward velocity v. 


After time t 


Initially 



i 


mg D 


u 



Answer 

This is quite a tricky question which is not fully discussed until Chapter 10. What we 
will show is that the centre of mass of any body moves as if it were a particle of mass 
equal to the total mass, and all the forces on the body acted upon it. In particular, 
a rigid body, moving without rotation, can be treated exactly as if it were a particle. 
For example, a block sliding without rotation on a table can be treated exactly as if it 
were a particle. In other cases we gain only partial infomiation about the motion. If 
the body is a brick thrown through the air, then particle dynamics can tell us exactly 
where its centre of mass will go, but not which point of the brick will hit the ground 
first. ■ 


4.1 RECTILINEAR MOTION IN A FORCE FIELD 

Our first group of problems is concerned with the straight line motion of a particle 
moving in a force field. A force F is said to be a field if it depends only on the position of 
the particle, and not, for instance, on its velocity or the time. For example, the gravitational 
attraction of any fixed mass distribution is a field of force, but resistance forces, which are 
usually velocity dependent, are not. 

If the rectlinear motion takes place along the z-axis, the equation of motion (4.1) 
reduces to the scalar equation 

dv 

m— = F{z), (4.2) 

dt 

where v is the (one-dimensional) velocity of the particle and F(z) is the (one-dimensional) 
force field, both measured in the positive z -direction. 

First we consider the problem of vertical motion of a particle under uniform gravity 
with no air resistance. This is fine on the Moon (which has no atmosphere) but, on 
Earth, the motion of a body is resisted by its passage through the atmosphere and this will 
introduce errors. The effect of resistance forces is investigated in section 4.3. 

Example 4.1 Vertical motion under uniform gravity 

A particle is projected vertically upwards with speed u and moves in a vertical straight 
line under uniform gravity with no air resistance. Find the maximum height achieved 
by the particle and the time taken for it to return to its starting point. 


Copyrighted material 




78 


Chapter 4 Problems in particle dynamics 


with the initial condition r = a when / = 0. After taking the positive square root of 
each side {dr/dt > 0 in this motion), the equation separates to give 

rfc) ,/2 —/;- 

(Here wc have introduced the initial and final conditions directly into the limits of 
integration; r is the elapsed time.) On simplifying, we obtain 


r = (MG)” I/2 



1/2 


dr. 


This integral can be evaluated by making the substitution r = 2 a sin 2 0; the details 
are unimportant. The result is that the time taken for P to progress from r = a to 
r = 2 a is 


r = 



Question Speed of escape from the Moon 

A body is projected vertically upwards from the surface of the Moon. What projection 
speed is neccessary for the body to escape the Moon? 

Answer 

We regard the Moon as a fixed symmetric sphere of mass M and radius R. In this case, 
the gravitational force exerted by the Moon is the same as that of a particle of mass M 
situated at the centre. Thus the preceding theory applies with the distance a replaced 
by the radius R. The escape speed is therefore (2 MG/R) 1 ^ 2 , which evaluates to 
about 2.4 kms -1 . [For the Moon, M = 7.35 x 10 22 kg and R = 1740 km.| ■ 

4.2 CONSTRAINED RECTILINEAR MOTION 

Figure 4.2 shows a uniform rigid rectangular block of mass M sliding down the 
inclined surface of a fixed rigid wedge of angle a. The initial conditions are supposed 
to be such that the block slides, without rotation, down the line of steepest slope of the 
wedge. The block is subject to uniform gravity, but it is clear that there must be other 
forces as well. If there were no other forces and the block were released from rest, then 
the block would move vertically downwards. However, solid bodies cannot pass through 
each other like ghosts, and interpenetration is prevented by (equal and opposite) forces 
that they exert upon each other. These are material contact forces which come into play 
only when bodies are in physical contact. They are examples of forces of constraint, 
which are not prescribed beforehand but are sufficient to enforce a specified geometrical 
constraint. Tradition has it that the constraint force that the wedge exerts on the block is 
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FIGURE 4.2 A rigid rectangluar block slides down the 
inclined surface of a fixed rigid wedge of angle a. Note 
that At' is the vertically upwards unit vector, while i and k 
are parallel to and perpendicular to the inclined surface of 
the wedge. 

called the total reaction force R. It is convenient to write this force in the form 


R = -Fi + Nk, 


where the unit vectors i and k are parallel to and normal to the slope of the wedge. The 
scalar N is called the normal reaction component and the scalar F is called the frictional 
component.* * 

The equation of motion of the block is the vector equation (4.1) which becomes 

d(vi ) v 

M ——- = -mgk ' -Fi + Nk, 
at 

where k ' is the vertically upwards unit vector. The easiest way of proceeding is to take 
components of this vector equation in the i- and A;-directions (the j -component gives 
nothing). On noting that k ' = — sin ai + cos ak, this gives 

dv 

M — = hip sin a — F and 0 = N — m g cos a. 
dt 

The second of these equations determines the normal reaction N = mg cos a. However, 
in the first equation, both v and F are unknown and this prevents any further progress 
in the solution of this problem.* One can proceed by proposing some empirical ‘law of 
friction’, but such laws hold only very roughly. It is not surprising then that, in much of 
mechanics, frictional forces are neglected. In this case, the total reaction force exerted by 
the surface is in the normal direction and we describe such surfaces as smooth, meaning 


* The minus sign is introduced so that F will be positive when the scalar velocity v is positive. 

* This reflects the fact that we have said nothing about the roughness of the surface of the wedge! 
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FIGURE 4.3 The idealised string is depicted 
here as having a small circular cross-section. 
At each cross-section only tensile stresses 
exist and their resultant is the tension T in the 
string at that point. 



‘perfectly smooth’. Doing away with friction has the advantage of giving us a well-posed 
problem that we can solve; however the solution will then apply only approximately to 
real surfaces. 

If we now suppose that the inclined surface of the wedge is smooth, then F — 0 and 
the first equation reduces to 


dv 

— = g sin a. 
dt 

Thus, in the absence of friction, the block slides down the plane with the constant accel¬ 
eration g sin cr. 

Inextensible strings 

Another agency that can cause a geometrical constraint is the inextensible string. If a 
particle P of a system is connected to a fixed point O by an inextensible string of length 
a then, if the string is taut, P is constrained to move so that the distance O P = a. This 
geometrical constraint is enforced by the (unknown) constraint force that the string applies 
to particle P. Our ‘string’ is an idealisation of real cords and ropes in that it is infinitely 
thin, has no bending stiffness, and is inextensible. The only force that one part of the string 
exerts on another is the tension T in the string, which acts parallel to the tangent vector t 
to the string at each point (see Figure 4.3). 

It is evident that, in general, T varies from point to point along the string. Suppose 
for example that a uniform string of mass p per unit length is suspended vertically under 
uniform gravity. Then, since the tension at the lower end is zero, the string will not be in 
equilibrium unless the tension at a height z above the lowest point is given by T = pgz\ 
the tension thus rises linearly with height. 

The situation is simpler when the mass of the string is negligible; this is the case of 
the light inextensible string.* In this case, it is obvious that the tension is constant when 
the string is straight. In fact, the tension also remains constant when the string slides over 
a smooth body. This is proved in Chapter 10. The tension in a light string is also constant 
when the string passes over a light, smoothly pivoted pulley wheel. 


* In this context, ‘light’ means ‘of zero mass’. 
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FIGURE 4.4 Attwood’s machine: two bodies 
of masses m and M are connected by a light 
inextensible string which passes over a 
smooth rail. 


t 



m 9 Mg 
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Example 4.3 Attwood’s machine 

Two bodies with masses ///, M arc connected by a light inextensible string which 
passes over a smooth horizontal rail. The system moves in a vertical plane with the 
bodies moving in vertical straight lines. Find the upward acceleration of the mass m 
and the tension in the string. 

Solution 

The system is shown in Figure 4.4. Let v be the upward velocity of the mass in. Then, 
since the string is inextensible, v must also be the downward velocity of the mass M . 
Also, since the string is light and the rail is smooth, the string has constant tension T. 
The scalar equations of motion for the two masses are therefore 

dv dv 

in — = T — ins, M — = M e — T. 

dt c dt 5 


It follows that 



Question The monkey puzzle 

Suppose that, in the last example, both bodies have the same mass M and one of them 
is a monkey which begins to climb up the rope. What happens to the other mass? 

Answer 

Suppose that the monkey climbs with velocity V relative to the rope. Then its upward 
velocity relative to the ground is V — v. The equations of upward motion for the mass 
and the monkey are therefore 

dv d(V-v) 

M — = T — Me, M— - - = T - M°. 

dt s dt 


Copyrighted material 












82 


Chapter 4 Problems in particle dynamics 


L 



v 


FIGURE 4.5 The drag D and lift L on a body moving through a 
fluid. 


On eliminating T , we find that 


dv _ , dV 
~dt ~ J ~dT' 

so that, if the whole system starts from rest, 

v=\V, and V - v = \V. 

Thus the monkey and the mass rise (relative to the ground) with the same velocity; 
the monkey cannot avoid hauling up the mass as well as itself! ■ 


4.3 MOTION THROUGH A RESISTING MEDIUM 

The physics of fluid drag 

When a body moves through a fluid such as air or water, the fluid exerts forces on the surface 
of the body. This is because the body must push the fluid out of the way, and to do this the body 
must exert forces on the fluid. By the Third Law, the fluid must then exert equal and opposite 
forces on the body. A person wading through water or riding a motorcycle is well aware of 
the existence of such forces, which fall into the general category of material contact forces. 
We arc interested in the resultant force that the fluid exerts on the body and it is convenient to 
write this resultant in the form 


F = D + L, 

where the vector drag force D has the opposite direction to the velocity of the body, and the 
vector lift force L is at right angles to this velocity. The existence of lift makes air travel 
possible and is obviously very important. However, we will be concerned only with drag since 
we will restrict our attention to those cases in which the body is a rigid body of revolution 
which moves (without rotation) in the direction of its axis of symmetry. In this case, the lift is 
zero, by symmetry. We are then left with the scalar drag D , acting in the opposite direction to 
the velocity of the body. 

The theoretical determination of lift and drag forces is one of the great unsolved problems 
of hydrodynamics and most of the available data has been obtained by experiment. Even for 
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the case of a rigid sphere moving with constant velocity through an incompressible* * fluid, a 
general theoretical solution for the drag is not available. In this problem, the drag depends 
on the radius a and speed V of the sphere, and the density p and viscosity p of the fluid. 
Straightforward dimensional analysis shows that D must have the form 


D = pa 2 V 2 F 



where F is a function of a single variable. 

Definition 4.1 Reynold’s number The dimensionless quantity R = pVa/p is called the 
Reynolds number.' It is more commonly written R = Va/v, where the quantity v = p/p is 
called the kinematic viscosity of the fluid. 

The function F (R) has never been calculated theoretically, and experimental data must be 
used. It is a surprising fact that, for a wide range of values of R (about 1000 < R < 100, 000), 
the function F is found to be roughly constant. Subject to this approximation, the formula for 
the drag becomes 


D = Cpa 2 V 2 , 

where the dimensionless constant C is called the drag coefficient* for the sphere; its value is 
about 0.8. 

A similar formula holds (with a different value of C) for any body of revolution moving 
parallel to its axis. In this case a is the radius of the maximum cross sectional area of the body 
perpendicular to the direction of motion. For example, the drag coefficient for a circular disk 
moving at right angles to its own plane is about 1.7. We thus obtain the result that (subject 
to the conditions mentioned above) the drag is proportional to the square of the speed of the 
body through the fluid. This is the quadratic law of resistance. 

This result does not hold for low Reynolds numbers. This was shown theoretically by 
Stokes^ in his analysis of the creeping flow of a fluid past a sphere. Stokes proved that, as 
R —> 0, the function F(R) ~ 6 n/R so that the drag formula becomes 

D ~ 6 TtapV. 

On dimensional grounds, a similar formula (with a different coefficient) should hold for other 
bodies of revolution. Thus at low Reynolds numbers ^ the drag is proportional to speed of the 
body through the fluid. This is the linear law of resistance. 

Which (if either) of these laws is appropriate in any particular case depends on the 
Reynolds number. However, it is quickly apparent that the low Reynolds number condition 
requires quite special physical conditions, as the following example shows. 


* In this treatment, the effects of fluid compressibility are neglected. In practice, this means that the speed 
of the body must be well below the speed of sound in the fluid. 

7 After the great English hydrodynamicist Osborne Reynolds 1842-1912. At the age of twenty six he was 
appointed to the University of Manchester’s first professorship of engineering. 

* The drag coefficient Co used by aerodynamicists is 2 C/n. 

§ George Gabriel Stokes 1819-1903, a major figure in British applied mathematics. 

^ Low means R less than about 0.5. 
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Table 1 Some fluid properties relevant to drag calculations (Kaye & Laby [14]). 



Density p 

Kinematic viscosity v 

Sound speed 


(kgm -3 ) 

(m 2 s -1 ) 

(m s _ 1 ) 

Air (20°C, 1 atm.) 

1.20 

1.50 x 10- 5 

343 

Water (20°C) 

998 

1.00 x 10“ 6 

1480 

Castor oil (20°C) 

950 

1.04 x nr 3 

1420 


Example 4.4 Which law of resistance? 

A stainless steel ball bearing of radius 1 mm is falling vertically with constant speed in 
air. Find the speed of the ball bearing. [The density of stainless steel is 7800 kg m -3 .] 
If the medium were castor oil, what then would be the speed of the ball bearing? 

Solution 

Suppose the ball bearing is falling with constant speed V. (We will later call V the 
terminal speed of the ball bearing.) Then, since its acceleration is zero, the total of 
the forces acting upon it must also be zero. Thus 

mg + D = m'g, 

where ///' is the mass of the ball bearing, m is the mass of the displaced fluid, and 
D is the drag. The term m'g is the gravity force acting downwards and the term mg 
is the (Archimedes) buoyancy force acting upwards.* (In air, the buoyancy force is 
negligible.) 

Hence, if the linear resistance law holds, then 

GnapvV = tna* (p' — p) g, 

where a is the radius of the ball bearing, and p', p arc the densities of the ball bearing 
and air respectively. This gives 



On using the numerical values given in Table 4.3 we obtain V = 940 ms -1 with the 
corresponding Reynolds number R = 63, 000. Quite apart from the fact that the cal¬ 
culated speed is nearly three times the speed of sound, this solution is disqualified on 
the grounds that the Reynolds number is 100,000 times too large for the low Reynolds 
number approximation to hold! 

On the other hand, if the quadratic law of resistance holds then 

Cpa 2 V 2 = 1 7ra 3 (p' - p) g, 


* It is not entirely obvious that the total force exerted by the fluid on the sphere is the sum of the drag and 
buoyancy forces, but it is true for an incompressible fluid. 
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where C is the drag coefficient for a sphere which we will take to be 0.8. In this case 


V 2 


4tt ag /V _ j 

3 C \p' 


This gives the value V = 19 ms -1 with the corresponding Reynolds number R = 
1250. This Reynolds number is nicely within the range in which the quadratic resis¬ 
tance law is applicable, and so provides a consistent solution. Thus the answer is that, 
in air, the ball bearing falls with a speed of 19 ms -1 . 

When the medium is castor oil, a similar calculation shows that it is the linear 
resistance law which provides the consistent solution. The answer is that, in castor 
oil, the ball bearing falls with a speed of 1.5 cms -1 , the Reynolds number being 
0 . 015 . 

This example illustrates the conditions needed for low Reynolds number flow: 
slow motion of a small body through a sticky fluid. Perhaps the most celebrated 
application of the low Reynolds number drag formula is Millikan's oil drop method 
of determining the electronic charge (see Problem 4.20 at the end of the chapter). ■ 


Example 4.5 Vertical motion under gravity with linear resistance 

A body is projected vertically upwards with speed u in a medium that exerts a drag 
force —niK v, where AT is a positive constant and v is the velocity of the body.* Pind 
the maximum height acheivcd by the body, the time taken to reach that height, and 
the terminal speed. 

Solution 

On including the linear resistance force, the scalar equation of motion becomes 

dv 

in — = —1119 —niKv, 
dt 

with the initial condition v = u when t = 0 (see Figure 4.1). This first order ODE for 
v separates in the form 

j jnr v = ~j d '' 

and, on integration, gives 

1 

— ln(g + K v) = —t + C, 

K 

where C is the integration constant. On applying the initial condition v = u when 
t = 0, we obtain C = K -l ln(g + Kit) and so 



* This is the vector drag force acting on the body; hence the minus sign. The coefficient is taken in the 
form mK for algebraic convenience. 
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This expression gives / in terms of v, which is what we need for finding the time taken 
to reach the maximum height. The maximum height is achieved when v = 0 so that 
r, the time taken to reach the maximum height, is given by 



The expression for t in terms of v can be inverted to give 

„ = i— c —*') 

for the upward velocity of the body at time t. 

The terminal speed of the body is the limit of |u| as t —»• oo. In this limit, the 
exponential terms tend to zero and 



Thus, in contrast to motion with no resistance, the speed of the body does not increase 
without limit as it falls, but tends to the finite value g/K . Thus the terminal speed of 
the body is g/K. 

The terminal speed can also be deduced directly from the equation of motion. 
If the body is falling with the terminal speed, then dv/dt = 0 and the equation of 
motion implies that 0 = —mg — niK v. It follows that the (upward) terminal velocity 

is -g/K. 

The maximum height z max can now be found by integrating the equation 
dz/dt = v and then putting t = r. However we can also obtain z max by starting 
again with a modified equation of motion. For some laws of resistance, this trick is 
essential. If we write 


d v dv dz dv 

dt dz dt dz' 

the equation of motion becomes 

dv 

v — = —2 — K v, 
dz * 

with the initial condition v = u when z = 0. This equation also separates to give 
= j-j^\n{g + Kv) + D , 

where D is the integration constant. On applying the initial condition v = u when 
z = 0, we obtain 


z 


-J + {2l"(g + Kv)+j--jL\„ (g + Ktl) 


1 ? 

— (w — v)-—=■ In 

K K 1 


g + Ku \ 
g + Kv) ‘ 
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This expression cannot be inverted to give v as a function of z, but it is exactly what 
we need to find z max . Since z max is achieved when v = 0, we find that the maximum 
height achcivcd by the body is given by 



Question Approximate form of : max for small K u/g 

Find an approximate expression for z max when K u/g is small. 

Answer 

When Ku/g is small, the log term can be expanded as a power series. This gives 



In this expression, the leading tenn w 2 /1g is just the value of z max in the absence 
of resistance. The first correction tenn has a negative sign which means that z max is 
reduced by the presence of resistance, as would be expected. ■ 

Question Ball bearing released in castor oil 

The ball bearing in Example 4.4 is released from rest in castor oil. How long does it 
take for the ball bearing to achieve 99% of its terminal speed? 

Answer 

Recall that the linear law of resistance is appropriate for this motion. Since the motion 
is entirely downwards, it is more convenient to measure v downwards in this problem, 
in which case the solution for v becomes 



where V is the terminal velocity. When v = 0.99T, e = 0.01 and so the time 
required is 


t = ln(100)V /g, 


which evaluates to about 7 milliseconds on using the value for V calculated in Exam¬ 
ple 4.4. ■ 

Note on the sign of resistance forces In the last example we used the same scalar equation of 
motion whether the body was rising or falling. This is correct in the case of linear resistance 
since, when the sign of v is reversed, so is the sign of K v. In the case of quadratic resistance 
however, when the sign of v is reversed, the sign of K v 2 remains unchanged and so the correct 
sign must be inserted manually. Thus, for quadratic resistance, the scalar equations of motion 
for ascent and descent are different. The same is true when the drag is proportional to any 
even power of v. 
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FIGURE 4.6 A particle, initially at the origin, 
is projected with speed u in a direction 
making an angle a with the horizontal. The 
particle moves under the uniform gravity 
force —mgk and the resistance (drag) 
force D. 



4.4 PROJECTILES 

A body that moves freely under uniform gravity, and possibly air resistance, is called 
a projectile. Projectile motion is very common. In ball games, the ball is a projectile, and 
controlling its trajectory is a large part of the skill of the game. On a larger scale, artillery 
shells are projectiles, but guided missiles, which have rocket propulsion, are not. 

The projectile problem differs from the problems considered in section 4.3 in that projec¬ 
tile motion is not restricted to take place in a vertical straight line. However, we will continue 
to assume that the effect of the air is to exert a drag force opposing the current velocity of 
the projectile.* It is then evident by symmetry that each projectile motion takes place in a 
vertical plane; this vertical plane contains the initial position of the projectile and is parallel 
to its initial velocity. 

Projectiles without resistance 

The first (and easiest) problem is that of a projectile moving without air resistance. This is fine 
on the Moon, but will be only an approximation to projectile motion on Earth. The effect of 
air resistance can be very significant, as our later examples will show. 

Example 4.6 Projectile without air resistance 

A particle which is subject solely to uniform gravity is projected with speeed u in a 
direction making an angle a with the horizontal. Find the subsequent motion. 

Solution 

Suppose that the motion takes place in the (x, z)-plane as shown in Figure 4.6. In the 
absence of the drag force, the vector equation of motion becomes 

dv 

m — = —nn>k, 
dt 

with the initial condition v = (ucosa)i + (w sinajA: when t = 0. If we now write 
v = v x i + v : k and take components of this equation (and initial condition) in the 


* This will be true if the projectile is a rigid sphere moving without rotation. 
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i- and ^-directions, we obtain the two scalar equations of motion 


dv x 

It 


= 0 , 


dv 2 

17 


= 


with the respective initial conditions v x = u cos a and v z = u sin a when t = 0. 
Simple integrations then give the components of the particle velocity to be 


v x = w cos a, v z = u sin a — gt. 

The position of the particle at time t can now be found by integrating the expressions 
for v x , v z and applying the initial conditions x = 0 and z = 0 when t = 0. This gives 

x = (ucosa)/, z = (m sin a) t — ^gi, 

the solution for the trajectory of the particle. ■ 


Question Form of the path 

Show that the path taken by the particle is an inverted parabola. 

Answer 

To find the path, eliminate t from the trajectory equations. This gives 

z = (tana)* - ( _ - . )* 2 , 

''2M z cos“a/ 

which is indeed an inverted parabola. ■ 

Question Time of flight and the range 

Find the time of flight and the range of the projectile on level ground. 

Answer 

On level ground, the motion will terminate when z = 0 again. From the second 
trajectory equation, this happens when (wsina)f — jgt 2 = 0. Hence the time of 
flight r is given by r = 2wsina/g. The horizontal range R is then obtained by 
putting t = x in the first trajectory equation, which gives 

m 2 sin 2a 

R = -. ■ 

S 


Question Maximum range 

Find the value of a that gives the maximum range on level ground when u is fixed. 

Answer 

R is a maximum when sin 2a = I, that is when a = 7r/4 in which case R m a\ = w 2 /g. 
Thus, if an artillery shell is to be projected over a horizontal range of 4 km, then the 
gun must have a muzzle speed of at least 200 m s _l . ■ 

There is a myriad of problems that can be found on the projectile with no air resistance, 
and some interesting examples are included at the end of the chapter. It should be noted 
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though that all these problems are dynamically equivalent to the problem solved above. Any 
difficulties lie in the geometry! 

Projectiles with resistance 

We now proceed to include the effect of air resistance. From our earlier discussion of fluid 
drag, it is evident that in most practical instances of projectile motion through the Earth’s 
atmosphere, it is the quadratic law of resistance that is appropriate. On the other hand, only 
the linear law of resistance gives rise to linear equations of motion and simple analytical 
solutions. This explains why mechanics textbooks contain extensive coverage of the linear 
case, even though this case is almost never appropriate in practice; the case that is appropriate 
cannot be solved! In the following example, we treat the linear resistance case. 


Example 4.7 Projectile with linear resistance 


A particle is subject to uniform gravity and the linear resistance force —mK v , where 
K is a positive constant and r is the velocity of the particle. Initially the particle is 
projected with speed u in a direction making an angle a with the horizontal. Find the 
subsequent motion. 


Solution 

With the linear resistance term included, the vector equation of motion becomes 

dv 

m — = —mK v — mek, 
dt 

with the initial condition v = (wcosa)i + (usina)k when t = 0. As in the last 
example, this equation resolves into the two scalar equations of motion 


dVy dV 7 

—— + K v x = 0, —— + K v z = — g, 

at at 

with the respective initial conditions v x = ucosa and v z = u sin a when t = 0. 
These first order ODEs are both separable and linear and can be solved by cither 
method; if they are regarded as linear, the integrating factor is e Kt . The equations 
integrate to give the components of the particle velocity to be 


v x = (ucosa)e Kt , t> z = (m sina)c K ' — ^1 — e • 


The position of the particle at time t can now be found by integrating the expressions 
for v x , v z and applying the initial conditions x — 0 and z = 0 when t — 0. This gives 


x = 


u cos a / 

1 -e-*'} 

K u sin a + g / 

1 -e~ K, \ 

' K~\ 

1 * ), 

K 2 \ 

1 e ) 


K 


(4.3) 


the solution for the trajectory of the particle. Figure 4.7 shows typical paths taken 
by the particle for the same initial conditions and three different values of the dimen¬ 
sionless resistance parameter A. (= Ku/g). (The case A. = 0 corresponds to zero 
resistance so that the path is a parabola.) It is apparent that resistance can have a 
dramatic effect on the motion. ■ 
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(u 2 /g) X z 



(uVg) 


1 

x 


FIGURE 4.7 Projectile motion under uniform gravity and linear 
resistance. The graphs show the paths of the particle for a — 7t/3 and 
three different values of the dimensionless resistance parameter X 
(= Ku/g). Except when A. = 0, the paths have vertical asymptotes. 


Question Vertical asymptote of the path 

Show that the path has a vertical asymptote. 

Answer 

Since e~ Kl decreases and tends to zero as t —► oo, it follows from equations (4.3) that 
the horizontal displacement x increases and tends to the value u cos a/K as t —> oo, 
while the vertical displacement z tends to negative infinity. Thus the vertical line 
x = u cosa/K is an asymptote to the path. In terms of the dimensionless variables 
used in Figure 4.7, this is the line (u 2 /g)~ ] x = cosa/X. ■ 

Question Approximate formula for the range when X is small 

Find an approximate formula for the range on level ground when the resistance param¬ 
eter X is small. 

Answer 

Since the particle returns to Earth again when z = 0, it follows from the second of equations (4.3) 
that the flight time r satisfies the equation 

(Ku sina + g) ^1 - e~ Kr ^j — Kgx — 0, 

which can be written in the form 

(A.sina+ 1) (l - e~ Kr \ - Kz = 0, (4.4) 

where A.(= Ku/g) is the dimensionless resistance parameter. Unfortunately, this equation cannot 
be solved explicity for r, and hence the need for an approximate solution. We know from the 
last example that, in the absence of resistance, the flight time r is given by r = 2 u sine*/#. It is 
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i i 


R/Ro 


i 



3-term approximation 
numerical solution 


2-term approximation 


A sin a 


0.5 


FIGURE 4.8 The ratio R/Ro plotted against X sin a. 


reasonable then, when X is small, to seek a solution for z in the form 



(4.5) 


where the coefficients b\, b 2 ,... are to be determined. To find the expansion coefficients we 
substitute the expansion (4.5) (truncated after the required number of terms) into the left side of 
equation (4.4), re-expand in powers of X, and then set the coefficients in this expansion equal to 
zero. The corresponding formula for the range can then be found by substituting this approximate 
formula for r into the first equation of (4.3) and re-expanding in powers of X. The details are 
tedious and, in fact, such operations are best done with computer assistance. The completion 
of this solution is the subject of computer assisted Problem 4.34 at the end of this chapter. The 
answer (to three terms) is that the range R on level ground is given by 



where Rq is the range when resistance is absent. Figure 4.8 compares two different approxima¬ 


tions to R with the ‘exact’ value obtained by numerical solution of equation (4.4). As would be 
expected, the three term approximation is closer to the exact value. ■ 


4.5 CIRCULAR MOTION 


In this section we examine some important problems in which a body moves on a 
circular path. Our first problem is concerned with a body executing a circular orbit under the 
gravitational attraction of a fixed mass. This is a fairly accurate model of the motion of the 
planets* around the Sun. 

Example 4.8 Circular orbit in the inverse square field 

A particle of mass m moves under the gravitational attraction of a fixed mass M 
situated at the origin. Show that circular orbits with centre O and any radius are 


* The orbits of Mercury, Mars and Pluto are the most elliptical with eccentricities of 0.206, 0.093 and 0.249 
respectively. The eccentricity of Earth’s orbit is 0.017. 
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possible, and find the speed of the particle in such an orbit. Deduce the period of the 
orbit. 


Solution 

Note that we are not required to find the general orbit; we may assume from the start 
that the orbit is a circle. Suppose then that the particle is executing a circular orbit 
with centre O and radius R. We need to confirm that the vector equation of motion 
can be satisfied. Take polar coordinates r, 0 with centre at O . Then the acceleration a 
of the particle is given in terms of the usual polar unit vectors by the formula (2.14), 
that is, 

a = (V - r() 2 ^? + (rd + 2 rd)6 

v 2 ^ . -2 

=-r + v 0 

R 


for motion on the circle r — R, where the circumferential velocity v — RO. The 
equation of motion for the particle is therefore 


m 


-T + vO 

R 


m MG 


R 2 


r. 


which, on taking components in the radial and transverse directions, gives 

v 2 MG 

— = —s- and v = 0. 

R R 2 

lienee the equation of motion is satisfied if v is a constant given by 

2 MG 
v = -. 

R 

Thus a circular orbit of radius R is possible provided that the particle has constant 
speed (MG//?) 1 / 2 . 

The period r of the orbit is the time taken for one circuit and is given by 

_ 2ttR _ /4 tt 2 R 2 \ 1/2 
T ~ v ~V MG ) 

Thus the square of the period of a circular orbit is proportional to the cube of its 
radius. This is a special case of Kepler’s third law of planetary motion (sec Chapter 
7). ■ 

A particle may move on a circular path because it is constrained to do so. The simplest 
and most important example of this is the simple pendulum, a mass suspended from a fixed 
point by a string. 


Example 4.9 The simple pendulum 

A particle P is suspended from a fixed point O by a light inextensible string of length 
b. P is subject to uniform gravity and moves in a vertical plane through O with the 
string taut. Find the equation of motion. 
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FIGURE 4.9 The simple pendulum 


mg r 


Solution 

The system is shown in Figure 4.9. Since the string is of fixed length b, the position 
of P is determined by the angle 0 shown. The acceleration of P can be expressed in 
the polar form 



where r and 0 are the polar unit vectors shown in Figure 4.9. 

P moves under the uniform gravity force —mgk and the tension T in the string 
which acts in the direction -r. It should be noted that the tension T is a force of 
constraint and not known beforehand. The equation of motion is therefore 



If we now take components of this equation in the radial and transverse directions we 
obtain 


• 9 •• 

—mbO = mg cos 0 — T, mbO = —mg sinfl. 


The second of these equations is the effective equation of motion in terms of the 
‘coordinate’ 6 , namely. 



(4.6) 


while the first equation determines the unknown tension T once 6 (t) is known. 

Equation (4.6) is the exact equation of motion for the simple pendulum. 
Because of the presence of the term in sin0, this second order ODE is non-linear 
and cannot be solved by using the standard technique for linear ODEs with constant 
coefficients. ■ 

Question The linear theory for small amplitude oscillations 

Find an approximate linear equation for the case in which the pendulum undergoes 
oscillations of small amplitude. 
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Answer 

If 0 is always small then sin# can be approximated by 0 in which ease the equation 
of motion becomes 


#+(0# = O. (4.7) 

This is the linearised equation for the simple pendulum, which holds approximately 
for oscillations of small amplitude. Although we do not cover linear oscillations 
until Chapter 5, many readers will recognise equation (4.7) as the simple harmonic 
motion equation and will know that the period r of the oscillations is given by r = 
2 n(b/g )'/ 2 , independent of the (small) amplitude. ■ 


Question Period of large oscillations 

Find the period of the pendulum when the (angular) amplitude of its oscillations is 
a, where a may not be small. 


Answer 

This requires that we integrate the exact equation of motion (4.6). We start with a familiar trick. 
If we write £2 = 0, then 

.. dQ dt2 dO dQ 

0 = — = — x — = Q — 

dt d6 dt dO 


and the equation of motion becomes 

dtt 


a\l /s>\ 

a— = — (—) si 

do \b/ 


sin 0. 


This is a separable first order ODE for Q which integrates to give 


±Q 2 


-(f) 


cos 6 + C\ 


where C is the constant of integration. On applying the initial condition Q = 0 when 0 = a, we 
find that C = — (g/b)*^ cos a and the integrated equation can be written 



(cos 0 — cos a ), 


(4.8) 


where we have now replaced Q by dO/dt. 

Since the pendulum comes to rest only when dO/dt = 0 (that is, when 0 = ±a) it follows 
that 0 must oscillate in the range —a < 0 < a. The period r is the time taken for one complete 
oscillation but, by the symmetry of equation (4.8) under the transformation 0 —> —0 , it follows 
that the time taken for the pendulum to swing from 0 = 0 to 0 = +a is r/4. To evaluate this time 
we take the positive square root of each side of equation (4.8) and integrate over the time interval 
0 < t < t/4. This gives 

r _ " ,,, -fyrrv 

Jo (cos 0 — cos a) 1 / 2 \ b ) Jo 

so that 

so y / 2 

g) 



L 


dO 


0 (cos# — cos a) */2 


(4.9) 
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This is the exact period of the pendulum when the amplitude of its oscillations is a. It is not 
possible to perform this integration in terms of standard functions* and so the integral must cither 
be evaluated numerically or be approximated. 

Numerical evaluation shows that the exact period is longer than that predicted by the lin¬ 
earised theory. When a = n/6, the period is 1.7% longer, and when a = 7r/3 it is 7.3% longer. 
The period can also be approximated by expanding the integral in equation (4.9) as a power series 
in a. This is the subject of Problem 4.35 at the end of the chapter. The answer is that, expanded 
to two terms, 

(b \ x/1 
x = 2n (?) 

This two term approximation predicts an increase in the period of 1.7% when a — tt/6. Note 
that there is no term in this expansion proportional to a and that the term in or 2 has the small 
coefficient 1/16. This explains why the prediction of the linearised theory is rather accurate even 
when a is not so small! ■ 

In our final example, we solve the important problem of an electrically charged particle 
moving in a uniform magnetic field. It turns out that plane motions arc circular, but the most 
general motion is helical. The solution in this case differs from the previous examples in that 
we use Cartesian coordinates instead of polars. This is because we do not know beforehand 
where the centre of the circle (or the axis of the helix) is, which means that we do not know on 
which point (or axis) to centre the polar coordinates. 


a -y 

' + 7t + 0(« 2 ) 
16 


(4.10) 


Example 4.10 Charged particle in a magnetic field 

A particle of mass m and charge e moves in a uniform magnetic field of strength Bo. 
Show that the most general motion is helical with the axis of the helix parallel to the 
direction of the magnetic field. 

Solution 

The total force F that an electric field E and magnetic field B exert on a charge e is 
given by the Lorentz force formula 1 

F = eE + ev x B, 

where v is the velocity of the charge. In our problem, there is no electric field and 
the magnetic field is uniform. If the direction of B is the z-direction of Cartesian 
coordinates, then B = Bok. The equation of motion of the particle is then 

dv 

m — = eBov x k. 
dt 

If we now write v in the component form v = v x i + v y j + v-k, the vector equation 
of motion resolves into the three scalar equations 

dv x dv x dv z 

— 1 = nv y , -± = -av x , —- = 0, (4.11) 

at ' dt dt 


* The integral is related to a special function called the complete elliptic integral of the first kind. 
v This form is correct in SI units. 
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where 


£2 = eBo/m. 


(4.12) 


The last of these equations shows that v z = V , a constant, so that the component of 
v parallel to the magnetic field is a constant. The first two equations are first order 
coupled ODEs but they are easy to uncouple. If we differentiate the first equation 
with respect to t and use the second equation, we find that v x satisfies the equation 


d 2 v x 


dt 2 


T £2 2 u^ — 0. 


This equation is a second order ODE with constant coefficients and can be solved in 
the standard way. However, many readers will recognise this as the SUM equation 
whose general solution can be written in the form 


v x = A sin(£2/ + a), 


where A and a are arbitrary constants. It is more convenient if we introduce a new 
arbitrary constant R defined by A = —£27?, so that 

v x = —£27? sin(£2/ + a). 


If we now substitute this formula for u v into the first equation of (4.11), we obtain 

v y = —£27? cos(£2/ + a). 

Having obtained the solution for the three components of v , we can now find the 
trajectory simply by integrating with respect to t. This gives 

x = 7? cos(£2/ + a) + a, y = —7? sin(£2/ + cr) + b, z = V t + c, 

where a, b, and c are constants of integration. These constants may be removed by 
a shift of the origin of coordinates to the point (a, b, c ), and the constant a may be 
removed by a shift in the origin of t. Also, the constant 7? may be assumed positive; 
if it is not, make a shift in the origin of / by ;r/£2. With these simplifications, the final 
form for the trajectory is 

x = 7?cos£2/, y = ~ 7? sin £2/, z = Vt, (4.13) 

where 7? is a positive constant and £2 = eBo/m. This is the most general trajectory 
for a charged particle moving in a uniform magnetic field. 

To identify this trajectory as a helix, suppose first that V = 0 so that the motion 
takes place in the (,v, y)-plane. Then the first two equations of (4.13) imply that 
the path is a circle of radius 7? traversed with constant speed 7? |£21 and with period 
27 t/|£ 2|. When V / 0, this circular motion is supplemented by a uniform velocity V 
in the z-direction. The result is a helical path of radius 7?. with its axis parallel to the 
magnetic field, which is traversed with constant speed (V 2 + 7? 2 £2 2 ) 1/2 . ■ 

The above problem has important applications to the cyclotron particle accelerator and 
the mass spectrograph. The cyclotron depends for its operation on thefact that the constant 
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£2, known as the cyclotron frequency, is independent of the velocity of the charged particles. 
The mass spectrograph is the subject of Problem 4.32 at the end of the chapter. 

Problems on Chapter 4 

Answers and comments are at the end of the book. 

Harder problems carry a star (*). 

Introductory problems 

4 .1 Two identical blocks each of mass M are connected by a light incxtcnsiblc string and can 
move on the surface of a rough horizontal table. The blocks are being towed at constant speed 
in a straight line by a rope attached to one of them. The tension in the tow rope is To. What is 
the tension in the connecting string? The tension in the tow rope is suddenly increased to 4 To. 
What is the instantaneous acceleration of the blocks and what is the instantaneous tension in 
the connecting string? 

4 .2 A body of mass M is suspended from a fixed point O by an inextensible uniform rope 
of mass m and length b. Find the tension in the rope at a distance z below O. The point of 
support now begins to rise with acceleration 2 g. What now is the tension in the rope? 

4 .3 Two uniform lead spheres each have mass 5000 kg and radius 47 cm. They are released 
from rest with their centres 1 m apart and move under their mutual gravitation. Show that they 
will collide in less than 425 s. [G = 6.67 x 10“ 11 N m 2 kg -2 .] 

4.4 The block in Figure 4.2 is sliding down the inclined surface of a fixed wedge. This time 
the frictional force F exerted on the block is given by F = /xN, where N is the normal reaction 
and ix is a positive constant. Find the acceleration of the block. How do the cases /x < tan a 
and n > tan a differ? 

4 .5 A stuntwoman is to be fired from a cannon and projected a distance of 40 m over level 
ground. What is the least projection speed that can be used? If the barrel of the cannon is 
5 m long, show that she will experience an acceleration of at least 4# in the barrel. [Take 
g = 10 m s -2 .] 

4.6 In an air show, a pilot is to execute a circular loop at the speed of sound (340 m s -1 ). The 
pilot may black out if his acceleration exceeds 8 g. Find the radius of the smallest circle he can 
use. [Take g = 10 ms -2 .] 

4.7 A body has terminal speed V when falling in still air. What is its terminal velocity (relative 
to the ground) when falling in a steady horizontal wind with speed U1 

4.8 Cathode ray tube A particle of mass m and charge e is moving along the .v-axis with 
speed u when it passes between two charged parallel plates. The plates generate a uniform 
electric field Eoj in the region 0 < x < b and no field elsewhere.* Find the angle through 


* This is only approximately true. 
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which the particle is deflected by its passage between the plates. [The cathode ray tube uses 
this arrangement to deflect the electron beam.| 

Straight line motion in a force field 

4.9 An object is dropped from the top of a building and is in view for time r while passing a 
window of height li some distance lower down. How high is the top of the building above the 
top of the window? 

4.10 A particle P of mass m moves under the gravitational attraction of a mass M fixed at 
the origin O . Initially P is at a distance a from O when it is projected with the critical escape 
speed (2MG/a) 1 ? 2 directly away from O. Find the distance of P from O at time t, and 
confirm that P escapes to infinity. 

4.11 A particle P of mass m is attracted towards a fixed origin O by a force of magnitude 
my /r 2 , where r is the distance of P from O and y is a positive constant. [It’s gravity Jim, 
but not as we know it.] Initially, P is at a distance a from O , and is projected with speed « 
directly away from O. Show that P will escape to infinity if u 2 > y /a 2 . 

For the case in which u 2 = y/{la 2 ), show that the maximum distance from O achieved 
by P in the subsequent motion is -Jla, and find the time taken to reach this distance. 

4.12 If the Earth were suddenly stopped in its orbit, how long would it take for it to collide 
with the Sun? [Regard the Sun as a fixed point mass. You may make use of the formula for the 
period of the Earth’s orbit. [ 

Constrained motion 

4.13 A particle P of mass /// slides on a smooth horizontal table. P is connected to a second 
particle Q of mass M by a light incxtensiblc string which passes through a small smooth hole 
O in the table, so that Q hangs below the table while P moves on top. Investigate motions 
of this system in which Q remains at rest vertically below O , while P describes a circle with 
centre O and radius b. Show that this is possible provided that P moves with constant speed 
w, where u 2 = Mgb/m. 

4.14 A light pulley can rotate freely about its axis of symmetry which is fixed in a horizontal 
position. A light incxtensiblc string passes over the pulley. At one end the string carries a mass 
Am, while the other end supports a second light pulley. A second string passes over this pulley 
and carries masses m and Am at its ends. The whole system undergoes planar motion with the 
masses moving vertically. Find the acceleration of each of the masses. 

4.15 A particle P of mass m can slide along a smooth rigid straight wire. The wire has one of 
its points fixed at the origin O, and is made to rotate in the (*, y)-plane with angular speed £2. 
By using the vector equation of motion of P in polar co-ordinates, show that r , the distance of 
P from O, satisfies the equation 


r - Q, 2 r = 0, 


Copyrighted material 


100 


Chapter 4 Problems in particle dynamics 


and find a second equation involving N, where NO is the force the wire exerts on P. [Ignore 
gravity in this question.] 

Initially, P is at rest (relative to the wire) at a distance a from O . Find r as a function of I 
in the subsequent motion, and deduce the corresponding formula for N. 


Resisted motion 

4 . 1 6 A body of mass m is projected with speed u in a medium that exerts a resistance force 
of magnitude (i) mk\ v |, or (ii) ///K \ v | 2 , where k and K arc positive constants and v is the 
velocity of the body. Gravity can be ignored. Detennine the subsequent motion in each case. 
Verify that the motion is bounded in case (i), but not in case (ii). 

4.17 A body is projected vertically upwards with speed u and moves under uniform gravity 
in a medium that exerts a resistance force proportional to the square of its speed and in which 
the body’s tenninal speed is V . Find the maximum height above the starting point attained by 
the body and the time taken to reach that height. 

Show also that the speed of the body when it returns to its starting point is uV/(V 2 + 
u 2 ) */ 2 . [Hint. The equations of motion for ascent and descent are different. See the note at the 
end of section 4.3.] 

4.18* A body is released from rest and moves under uniform gravity in a medium that exerts 
a resistance force proportional to the square of its speed and in which the body’s terminal speed 
is V . Show that the time taken for the body to fall a distance It is 

— cosh -1 (eZ h t v ~\ . 
g v 7 


In his famous (but probably apocryphal) experiment, Galileo dropped different objects 
from the top of the tower of Pisa and timed how long they took to reach the ground. If Galileo 
had dropped two iron balls, of 5 mm and 5 cm radius respectively, from a height of 25 m, what 
would the descent times have been? Is it likely that this difference could have been detected? 
[Use the quadratic law of resistance with C = 0.8. The density of iron is 7500 kgm -3 .] 

4.19 A body is projected vertically upwards with speed w and moves under uniform gravity in 
a medium that exerts a resistance force proportional to the fourth power its speed and in which 
the body’s terminal speed is V . Find the maximum height above the starting point attained by 
the body. 

Deduce that, however large u may be, this maximum height is always less than ttV 2 /4g. 

4.20 Millikan’s experiment A microscopic spherical oil droplet, of density p and unknown 
radius, carries an unknown electric charge. The droplet is observed to have terminal speed tq 
when falling vertically in air of viscosity p. When a uniform electric field E o is applied in the 
vertically upwards direction, the same droplet was observed to move upwards with terminal 
speed i> 2 . Find the charge on the droplet. [Use the low Reynolds number approximation for 
the drag.] 
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Projectiles 

4.21 A mortar gun, with a maximum range of 40 m on level ground, is placed on the edge of 
a vertical cliff of height 20 m overlooking a horizontal plain. Show that the horizontal range 
R of the mortar gun is given by 


R =40 


sin a + ^ 1 + sin 2 ’ J 


cos or, 


where a is the angle of elevation of the mortar above the horizontal. [Take g = 10 m s -2 .] 
Evaluate R (to the nearest metre) when cr = 45° and 35° and confirm that a = 45° does 
not yield the maximum range. |Do not try to find the optimum projection angle this way. See 
Problem 4.22 below.] 


4.22 It is required to project a body from a point on level ground in such a way as to clear a 
thin vertical barrier of height h placed at distance a from the point of projection. Show that 
the body will just skim the top of the barrier if 

tan 2 a — a tan a + (^~2 ^ = 

where u is the speed of projection and a is the angle of projection above the horizontal. 
Deduce that, if the above trajectory is to exist for some a , then u must satisfy 

w 4 — 2 ghu 2 — g 2 a 2 > 0. 

Find the least value of u that satisfies this inequality. 

For the special case in which a = \/3//, show that the minimum projection speed neces¬ 
sary to clear the barrier is (3 gh)i, and find the projection angle that must be used. 

4.23 A particle is projected from the origin with speed u in a direction making an angle a 
with the horizontal. The motion takes place in the (x, z)-plane, where Oz points vertically 
upwards. If the projection speed u is fixed, show that the particle can be made to pass through 
the point (a, b) for some choice of a if (a, b) lies below the parabola 



This is called the parabola of safety. Points above the parabola are ‘safe’ from the projectile. 

An artillery shell explodes on the ground throwing shrapnel in all directions with speeds 
of up to 30 m s -1 . A man is standing at an open window 20 m above the ground in a building 
60 m from the blast. Is he safe? [Take g = 10 m s -2 .] 

4.24 A projectile is fired from the top of a conical mound of height // and base radius a. What 
is the least projection speed that will allow the projectile to clear the mound? [Hint. Make use 
of the parabola of safety. | 

A mortar gun is placed on the summit of a conical hill of height 60 m and base diameter 
160 m. If the gun has a muzzle speed of 25 ms -1 , can it shell anywhere on the hill? [Take 
g = 10 ms -2 .] 



Copyrighted material 






102 


Chapter 4 Problems in particle dynamics 


4.25 An artillery gun is located on a plane surface inclined at an angle fi to the horizontal. 
The gun is aligned with the line of steepest slope of the plane. The gun fires a shell with speed 
u in the direction making an angle a with the (upward) line of steepest slope. Find where the 
shell lands. 

Deduce the maximum ranges R 1 , R D , up and down the plane, and show that 

R v 1 - sin 
Y 5 ~ 1 + sinks' 

4.26 Show that, when a particle is projected from the origin in a medium that exerts linear 
resistance, its position vector at time t has the general form 

r = —a(t)k + 

where k is the vertically upwards unit vector and u is the velocity of projection. Deduce the 
following results: 

(i) A number of particles arc projected simultaneously from the same point, with the 
same speed, but in different directions. Show that, at each later time, the particles all 
lie on the surface of a sphere. 

(ii) A number of particles arc projected simultaneously from the same point, in the same 
direction, but with different speeds. Show that, at each later time, the particles all lie 
on a straight line. 

(iii) Three particles arc projected simultaneously in a completely general manner. Show 
that the plane containing the three particles remains parallel to some fixed plane. 

4.27 A body is projected in a steady horizontal wind and moves under uniform gravity and 
linear air resistance. Show that the influence of the wind is the same as if the magnitude and 
direction of gravity were altered. Deduce that it is possible for the body to return to its starling 
point. What is the shape of the path in this case? 

Circular motion and charged particles 

4.28 The radius of the Moon’s approximately circular orbit is 384,000 km and its period is 
27.3 days. Estimate the mass of the Earth. [G = 6.67 x 10“ 11 Ntrrkg -2 .] The actual mass 
is 5.97 x 10 24 kg. What is the main reason for the error in your estimate? 

An artificial satellite is to be placed in a circular orbit around the Earth so as to be ‘geo¬ 
stationary’. What must the radius of its orbit be? [The period of the Earth’s rotation is 23 h 56 
m, not 24 h. Why?] 

4.29 Conical pendulum A particle is suspended from a fixed point by a light inextensible 
string of length a. Investigate ‘conical motions’ of this pendulum in which the string maintains 
a constant angle a with the downward vertical. Show that, for any acute angle a, a conical 
motion exists and that the particle speed n is given by w 2 = ag sin a tana. 

4. 30 A particle of mass in is attached to the highest point of a smooth rigid sphere of radius 
a by a light inextensible string of length na/ 4. The particle moves in contact with the outer 
surface of the sphere, with the string taut, and describes a horizontal circle with constant 
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speed u. Find the reaction of the sphere on the particle and the tension in the string. Deduce 
the maximum value of u for which such a motion could take place. What will happen if u 
exceeds this value? 

4.31 A particle of mass in can move on a rough horizontal table and is attached to a fixed 
point on the table by a light inextensible string of length b. The resistance force exerted on 
the particle is —niK v, where v is the velocity of the particle. Initially the string is taut and 
the particle is projected horizontally, at right angles to the string, with speed u. Find the angle 
turned through by the string before the particle comes to rest. Find also the tension in the string 
at time t. 

4.32 Mass spectrograph A stream of particles of various masses, all carrying the same 
charge e, is moving along the x-axis in the positive x-direction. When the particles reach the 
origin they encounter an electronic ‘gate’ which allows only those particles with a specified 
speed V to pass. These particles then move in a uniform magnetic field Bo acting in the z- 
direction. Show that each particle will execute a semicircle before meeting the y-axis at a 
point which depends upon its mass. [This provides a method for determining the masses of 
the particles.] 

4.33 The magnetron An electron of mass in and charge — e is moving under the combined 
influence of a uniform electric field Eoj and a uniform magnetic field Bok. Initially the 
electron is at the origin and is moving with velocity u i. Show that the trajectory of the electron 
is given by 


x = a{Q.t) + b sin £2/, y = b{\ — cos £2f), z = 0, 

where £2 = eBo/m,a = Eo/QBq and b = (uBq — £o)/£2Z?o. Use computer assistance to plot 
typical paths of the electron for the cases a < b,a = b and a > b. [The general path is called a 
trochoid, which becomes a cycloid in the special case a = b. Cycloidal motion of electrons is 
used in the magnetron vacuum tube, which generates the microwaves in a microwave oven.] 

Computer assisted problems 

4.34 Complete Example 4.7 on the projectile with linear resistance by obtaining the quoted 
asymptotic formula for the range of the projectile. 

4.35 Find a series approximation for the period of the simple pendulum, in powers of the 
angular amplitude a. Proceed as follows: 

The exact period r of the pendulum was found in Example 4.9 and is given by the integral (4.9). This 
integral is not suitable for expansion as it stands. However, if we write cos# — cosa = 2(sin 2 (o , /2) — 
sin 2 (#/2)) and make the sneaky substitution sin(#/2) = sin(a/2) sin0, the formula for r becomes 



where e = sin(a/2). This new integrand is easy to expand as a power series in the variable e and the limits 
of integration are now constants. Use computer assistance to expand the integrand to the required number 
of terms and then integrate term by term over the interval [0, n/2\. Finally re-expand as a power series 
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in the variable a. The answer to two terms is given by equation (4.10), but it is just as easy to obtain any 
number of terms. 

4.36 Baseball trajectory A baseball is struck with an initial speed of 45 m s” 1 (just over 100 
mph) at an elevation angle of 40°. Find its path and compare this with the corresponding path 
when air resistance is neglected. [A baseball has mass 0.30 kg and radius 3.5 cm. Assume the 
quadratic law of resistance.] 

Show that the equation of motion can be written in the form 



where V is the terminal speed. Resolve this vector equation into two (coupled) scalar equations for v x and 
v z and perform a numerical solution. In this example, air resistance reduces the range by about 35%. It 
really is easier to hit a home run in Mile High stadium! 
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Linear oscillations 

and normal modes 


KEY FEATURES 

The key features of this chapter are the properties of free undamped oscillations, free damped 
oscillations, driven oscillations, and coupled oscillations. 


Oscillations are a particularly important part of mechanics and indeed of physics as a 
whole. This is because of their widespread occurrence and the practical importance of oscil¬ 
lation problems. In this chapter we study the classical linear theory of oscillations, which 
is important for two reasons: (i) the linear theory usually gives a good approximation to the 
motion when the amplitude of the oscillations is small, and (ii) in the linear theory, most prob¬ 
lems can be solved explicitly in closed form. The importance of this last fact should not be 
underestimated! We develop the theory in the context of the oscillations of a body attached to a 
spring, but the same equations apply to many different problems in mechanics and throughout 
physics. 

In the course of this chapter we will need to solve linear second order ODEs with constant 
coefficients. For a description of the standard method of solution see Boyce & DiPrima [8]. 


5.1 BODY ON A SPRING 

Suppose a body of mass m is attached to one end of a light spring. The other end of 
the spring is attached to a fixed point A on a smooth horizontal table, and the body slides on 


Equilibrium position 


In motion 


Forces 



FIGURE 5.1 The body m is attached to one end of a light spring and moves in a 
straight line. 
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the table in a straight line through A. Let x be the displacement and v the velocity of the body 
at time t, as shown in Figure 5.1; note that x is measured from the equilibrium position of the 
body. 

Consider now the forces acting on the body. When the spring is extended, it exerts a 
restoring force S in the opposite direction to the extension. Also, the body may encounter 
a resistance force R acting in the opposite direction to its velocity. Finally, there may be an 
external driving force G(t) that is a specified function of the time. The equation of motion for 
the body is then 

m—=-S - R+G(t). (5.1) 

dt 

The restoring force S is determined by the design of the spring and the extension ,v. For 
sufficiently small strains* the relationship between 5 and x is approximately linear , that is, 

S = ax, (5.2) 

where a is a positive constant called the spring constant (or strength) of the spring. A 
powerful spring, such as those used in automobile suspensions, has a large value of a ; the 
spring behind a doorbell has a small value of a. The formula (5.2) is called Hooke’s law ' and 
a spring that obeys Hooke’s law exactly is called a linear spring. 

The resistance force R depends on the physical process that is causing the resistance. 
For fluid resistance, the linear or quadratic resistance laws considered in Chapter 4 may be 
appropriate. However, neither of these laws represents the frictional force exerted by a rough 
table. In this chapter we assume the law of linear resistance 

R = P v, (5.3) 

where fi is a positive constant called the resistance constant; it is a measure of the strength 
of the resistance. There is no point in disguising the fact that our major reason for assuming 
linear resistance is that (together with Hooke’s law) it leads to a linear equation of motion that 
can be solved explicitly. However, it does give insight into the general effect of all resistances, 
and actually is appropriate when the resistance arises from slow viscous flow (automobile 
shock absorbers, for instance); it is also appropriate in the electric circuit analogue, where it is 
equivalent to Ohm’s law. 

With Hooke’s law and linear resistance, the equation of motion (5.1) for the body 
becomes 

d 2 x dx 

/H-pr + P— + of Jf = G(t), 
dt 2 dt 


* The strain is the extension of the spring divided by its natural length. If the strain is large, then the linear 
approximation will break down and a non-linear approximation, such as 5 = a x + bx 2 must be used 
instead. 

' After Robert Hooke (1635-1703). Hooke was an excellent scientist, full of ideas and a first class exper¬ 
imenter, but he lacked the mathematical skills to develop his ideas. When other scientists (Newton in 
particular) did so, he accused them of stealing his work and this led to a succession of bitter disputes. So 
that his rivals could not immediately make use of his discovery, Hooke first published the law that bears 
his name as an anagram on the Latin phrase L ut tensio, sic vis’ (as the extension, so the force). 
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where a is the spring constant, fi is the resistance constant and G{t) is the prescribed driving 
force. This is a second order, linear ODE with constant coefficients for the unknown displace¬ 
ment x(t). We could go ahead with the solution of this equation as it stands, but the algebra is 
made much easier by introducing two new constants £2 and K (instead of a and fi) defined by 
the relations 


a = m Q ft = 2iii K. 

The equation of motion for the body then becomes 


d 2 x 

It 2 


dx 0 

+ 2K — + Q 2 x = F(t ) 
dt 


(5.4) 


where F(t) = G(t)/m, the driving force per unit mass. This is the standard form of the 
equation of motion for the body. Any system that leads to an equation of this form is called 
a damped* linear oscillator. When the force F(t) is absent, the oscillations are said to be 
free; when it is present, the oscillations arc said to be driven. 


5.2 CLASSICAL SIMPLE HARMONIC MOTION 


A linear oscillator that is both undamped and undriven is called a classical linear 
oscillator. This is the simplest case, but arguably the most important system in physics! The 
equation (5.4) reduces to 


d 2 x 

It 2 


+ £2 2 x = 0, 


(5.5) 


which, because of the solutions we are about to obtain, is called the SHM equation. 


Solution procedure 

Seek solutions of the form x = e Xt . Then k must satisfy the equation 

k 2 + £2 2 = 0, 

which gives k = ±/£2. We have thus found the pair of complex solutions 

x = e ±ia ‘, 

which form a basis for the space of complex solutions. The real and imaginary parts of the 
first complex solution are 

cos £2t 

X = . _ 

sin £2,t 


* Damping is another term for resistance. Indeed, automobile shock absorbers are sometimes called 
dampers. 


Copyrighted material 







108 


Chapter 5 


Linear oscillations 


X 



FIGURE 5.2 Classical simple harmonic motion 
x = C cos(£2/ — y). 

and these functions form a basis for the space of real solutions. The general real solution of 
the SHM equation is therefore 


x = A cos £2/ + B sin £2/, (5.6) 

where A and B are real arbitrary constants. This general solution can be written in the alterna¬ 
tive form* 


x = C cos(£21 — y), (5.7) 

where C and y are real arbitrary constants with C > 0. 


General form of the motion 

The general form of the motion is most easily deduced from the form (5.7) and is shown in 
Figure 5.2. This is called simple harmonic motion (SHM). The body makes infinitely many 
oscillations of constant amplitude C ; the constant y is simply a ‘phase factor’ which shifts 
the whole graph by y/£2 in the /-direction. Since the cosine function repeats itself when the 
argument £2/ increases by 2 n, it follows that the period of the oscillations is given by 


T = 


2tt 

~£ 2 ~' 


(5.8) 


The quantity £2, which is related to the frequency v by £2 = 2nv, is called the angular 
frequency of the oscillations. 


Example 5.1 An initial value problem for classical SHM 

A body of mass m is suspended from a fixed point by a light spring and can move 
under uniform gravity. In equilibrium, the spring is found to be extended by a distance 
b. Find the period of vertical oscillations of the body about this equilibrium position. 
[Assume small strains.] 


* This transformation is based on the result from trigonometry that a cos6 + b sin# can always be written 
in the form ccos(0 — y), where c — (tf 2 + ft 2 )’/ 2 and tany — b/a. 


Copyrighted material 












5.3 


Damped simple harmonic motion 


109 


The body is hanging in its equilibrium position when it receives a sudden blow 
which projects it upwards with speed w. Find the subsequent motion. 

Solution 

When the spring is subjected to a constant force of magnitude mg, the extension is b. 
Hence a, the strength of the spring, is given by a = nig/b. 

Let z be the downwards displacement of the body from its equilibrium position. 
Then the extension of the spring is b + z and the restoring force is a(b + z) = 
g{b + z)/b. The equation of motion for the body is therefore 

d 2 z mg(b + z) 

m —- = hi g - 

di 2 * b 


that is 


d 2 z 




This is the SHM equation with £2 2 = g/b. It follows that the period r of vertical 
oscillations about the equilibrium position is given by 


In the initial value problem, the subsequent motion must have the fonn 


x = A cos Q,t + B sin £2/, 

where £2 = (g/b) 1 ^ 2 . The initial condition x = 0 when t = 0 shows that A = 0 and 
the initial condition x = —u when t = 0 then gives SIB = —u, that is, B = — u/Q.. 
The subsequent motion is therefore 

u 

x =-sin Q.t , 

£2 


where £2 = (g/b) 1 ^ 2 . ■ 


5.3 DAMPED SIMPLE HARMONIC MOTION 


When damping is present but there is no external force, the general equation (5.4) 
reduces to 


d 2 x 

~dt 2 


dx 

+ 2 K — + £2 2 x = 
dt 


0 , 


(5.9) 


the damped SHM equation. 

The solution procedure is the same as in the last section. Seek solutions of the fonn 
x = e Kt . Then k must satisfy the equation 

X 2 + 2 Kk + £2 2 = 0, 
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that is 


(X + K) 2 = K 2 - Or. 

We see that different cases arise depending on whether K < £2. K = Q. or K > £2. These 
cases give rise to different kinds of solution and must be treated separately. 


Under-damping (sub-critical damping): K < Q 

In this case, we write the equation for X in the form 

(X + K) 2 = -Q 2 d , 

where = (£2 2 — K 2 )^ 2 , a positive real number. The X values are then X — —K ± i£2o- 
We have thus found the pair of complex solutions 

x = e~ Kt e ±iSit>t , 

which form a basis for the space of complex solutions. The real and imaginary parts of the 
first complex solution arc 

_ | e~ Kt cos Slot 
e~ Kt sin Q[)t 

and these functions form a basis for the space of real solutions. The general real solution of 
the damped SHM equation in this case is therefore 

x = e~ Kt (A cos Q.ot + B sin Slot ), (5.10) 

where A and B are real arbitrary constants. This general solution can be written in the alterna¬ 
tive form 


x = Ce Kt cos(^i)/ — y), (5.11) 

where C and y are real arbitrary constants with C > 0. 

General form of the motion 

The general form of the motion is most easily deduced from the form (5.11) and is shown 
in Figure 5.3. This is called under-damped SHM. The body still executes infinitely many 
oscillations, but now they have exponentially decaying amplitude Ce~ Kt . Suppose the period 
r of the oscillations is defined as shown in Figure 5.3.* The introduction of damping decreases 
the angular frequency of the oscillations from £2 to Qy, which increases the period of the 
oscillations from 2tt/£2 to 

2tz 2n ^ 

T ~ [Q . 1 - K 2 )'/ 2 ' 


The period might also be defined as the time interval between successive maxima of the function x(t). 
Since these maxima do not occur at the points at which x(t) touches the bounding curves, it is not obvious 
that this time interval is even a constant. However, it is a constant and has the same value as (5.12) (see 
Problem 5.5) 
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X 

k 



FIGURE 5.3 Under-damped simple harmonic motion 
x = Ce~ Kt cos (Slpt — y). 


Over-damping (super-critical damping): K > Q, 

In this case, we write the equation for A. in the fonn 

(X + K) 2 = 8 2 , 

where 5 = (K 2 — £2 2 )'/ 2 , a positive real number. The X values are then X = — k ± 8, which 
arc now real. We have thus found the pair of real solutions 


which form a basis for the space of real solutions. The general real solution of the damped 
SHM equation in this case is therefore 

x = e~ Kl (Ae St + Be~ s '), (5.13) 

where A and B arc real arbitrary constants. 

General form of the motion 

Three typical forms for the motion arc shown in Figure 5.4. This is called over-damped SHM. 
Somewhat surprisingly, the body does not oscillate at all. For example, if the body is released 
from rest, then it simply drifts back towards the equilibrium position. On the other hand, if 
the body is projected towards the equilibrium position with sufficient speed, then it passes the 
equilibrium position once and then drifts back towards it from the other side. 


Critical damping: K = Q. 

The case of critical damping is solved in Problem 5.6. Qualitatively, the motions look like 
those in Figure 5.4. 
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X 


k. 



- t 


FIGURE 5.4 Three typical cases of over-damped simple 
harmonic motion. 


5.4 DRIVEN (FORCED) MOTION 


Wc now include the effect of an external driving force G (t) which we suppose to be 
a given function of the time. In the case of a body suspended by a spring, we could apply such 
a force directly, but, in practice, the external ‘force’ often arises indirectly by virtue of the 
suspension point being made to oscillate in some prescribed way. The seismograph described 
in the next section is an instance of this. 

Whatever the origin of the driving force, the governing equation for driven motion is 
(5.4), namely 



(5.14) 


where 2mK is the damping constant, mQ 2 is the spring constant and niF(t) is the driving 


force. Since this equation is linear and inhomogeneous, its general solution is the sum of (i) 
the general solution of the corresponding homogeneous equation (5.9) (the complementary 
function) and (ii) any particular solution of the inhomogeneous equation (5.14) (the partic¬ 
ular integral). The complementary function has already been found in the last section, and 
it remains to find the particular integral for interesting choices of F(t). Actually there is a 
(rather complicated) formula for a particular integral of this equation for any choice of the 
driving force mF(t). However, the most important case by far is that of time harmonic forc¬ 
ing and, in this case, it is easier to find a particular integral directly. Time harmonic forcing is 
the case in which 


(5.15) 


F(t) = F q cos pt. 


where Fq and p are positive constants; iiiFq is the amplitude of the applied force and p is its 
angular frequency. 
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Solution procedure 

Wc first replace the forcing term Focos pt by its complex counterpart Fqc'P' . This gives the 
complex equation 

d 2 x „ dx , • , 

-t + 2K — + tfx = F 0 e ,pt . (5.16) 

dt 2 dt 

We then seek a particular integral of this complex equation in the form 

x = ce ipt , (5.17) 


where c is a complex constant called the complex amplitude. On substituting (5.17) into 
equation (5.16) we find that 


so that the complex function 


Fo 

Cl 2 - p 2 + 2iKp' 


(5.18) 


Foc ,pt 

£2 2 — p 2 + 2 i Kp 


(5.19) 


is a particular integral of the complex equation (5.16). A particular integral of the real equa¬ 
tion (5.14) is then given by the real part of the complex expression (5.19). It follows that a 
particular integral of equation (5.14) is given by 


x D = a cos (pt — y), 


where a = |c| and y = — arge. This particular integral, which is also time harmonic with the 
same frequency as the applied force, is called the driven response of the oscillator to the force 
iiiFqCOs pt’, a is the amplitude of the driven response and y (0 < y < n) is the phase angle 
by which the response lags behind the force. From the expression (5.18) fore, it follows that 


_ Fo _ 

((Q 2 - p 2 ) 2 + 4K 2 p 2 )' 12 ’ 


tan y — 


2 Kp 

£> 2 -/; 2 ' 


(5.20) 


The general solution of equation (5.14) therefore has the form 


C F 

x = a cos {pt — y) + x , 


(5.21) 


where x CF is the complementary function, that is, the general solution of the corresponding 
undriven problem. 

The undriven problem has already been solved in the last section. The solution took three 
different forms depending on whether the damping was supercritical, critical or subcritical. 
However, all these forms have one feature in common, that is, they all decay to zero with 
increasing time. For this reason, the complementary function for this equation is often called 
the transient response of the oscillator. Any solution of equation (5.21) is therefore the sum 
of the driven response x D (which persists) and a transient response x CF (which dies away). 
Thus, no matter what the initial conditions, after a sufficently long time we are left with just 
the driven response. In many problems, the transient response can be disregarded, but it must 
be included if inital conditions are to be satisfied. 
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Example 5.2 An initial value problem for driven motion 

The equation of motion of a certain driven damped oscillator is 

d 2 x „dx 

—— “I - 3 — T 2x — 10 cos t 
dt 2 dt 

and initially the particle is at rest at the origin. Find the subsequent motion. 

Solution 

First we find the driven response x D . The complex counterpart of the equation of 
motion is 


d 2 x 

It* 


.dx 

+ 3— + 2x = 
dt 


10c" 


and we seek a solution of this equation of the form x = ce ". On substituting in, we 
find that 


10 


c = 


= 1 - 3/. 


1 + 3/ 

It follows that the driven response x D is given by 

x D = Dt ^(1 — 3/)c"J = cost + 3sint. 

Now for the complementary function x CF . This is the general solution of the 
corresponding undriven equation 

d 2 x dx 

~—y + 3——|- 2x — 0, 

dt 2 dt 


which is easily found to be 


a; = Ae ~' + Bc~ 2 ', 

where A and B are arbitrary constants. The general solution of the equation of 
motion is therefore 


x — cos t + 3 sin t + Ae ' + Be 2t . 

It now remains to choose A and B so that the initial conditions are satisfied. The 
condition x = 0 when / = 0 implies that 

0 = 1 + A + B, 

and the condition x = 0 when t = 0 implies that 

0= 3- A - 2B. 
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FIGURE 5.5 The solid curve is the actual 
response and the dashed curve the driven 
response only. 


X 



Solving these simultaneous equations gives A = —5 and B = 4. The subsequent 
motion of the oscillator is therefore given by 

JC = cos / + 3 sin t -5e~' + 4e~ 2 '. 

driven response transient response 

This solution is shown in Figure 5.5 together with the driven response only. In this 
case, the transient response is insignificant after less than one cycle of the driving 
force. The amplitude of the driven reponse is (l 2 + 3 2 )^ 2 = vTO and the phase lag 
is tan“'(3/1) % 72°. ■ 

Resonance of an oscillating system 

Consider the general formula 

a = _ [o _ 

((n 2 - P 2 ) 2 + 4 k 2 p 2 ) 1/2 

for the amplitude a of the driven response to the force iiiFq cos pt (see equation (5.20)). Sup¬ 
pose that the amplitude of the applied force, the spring constant, and the resistance constant 
are held fixed and that the angular frequency p of the applied force is varied. Then a is a 
function of p only. Which value of p produces the largest driven response? Let 

f(q) = (Q 2 -q) 2 + 4K 2 q. 

Then, since a = Fo/VTip 2 ), we need only find the minimum point of the function f (q) lying 
in q >0. Now 

f{q) = -2 (£> 2 - q) + 4tf 2 = 2 [q - (£2 2 - IK 2 )) 

so that f(q) decreases for q < Q. 2 — IK 2 and increases for q > Q. 2 — 2K 2 . Hence f(q) has 
a unique minimum point at q = Or — 2K 2 . Two cases arise depending on whether this value 
is positive or not. 

Case 1. When £2 2 > 2 K 2 , the minimum point q = Q, 2 — 2K 2 is positive and a has its 
maximum value when p = p R , where 

p R = (£2 2 — 2K 2 ) { ^ 2 . 
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FIGURE 5.6 The dimensionless amplitude (Fo/Q~)a 
against the dimensionless driving frequency p/£2 for 
(from the top) K / £2 = 0.1,0.2, 0.3, 1. 


The angular frequency p R is called the resonant frequency of the oscillator. The 
value of a at the resonant frequency is 

Fq 

" max 2K(n 2 - K 2 ) 1 ' 2 ' 

Case 2. When £2 2 < 2 K 2 , a is a decreasing funcion of p for p > 0 so that a has no maximum 
point. 

These results are illustrated in Figure 5.6. They are an example of the general physical 
phenomenon known as resonance, which can be loosely stated as follows: 


The phenomenon of resonance 

Suppose that, in the absence of damping, a physical system can perform free oscillations 
with angular frequency £2. Then a driving force with angular frequency p will induce a 
large response in the system when p is close to £2, providing that the damping is not too 
large. 


This principle does not just apply to the mechanical systems we study here. It is a general 
physical principle that also applies, for example, to the oscillations of electric currents in 
circuits and to the quantum mechanical oscillations of atoms. 

Note that the resonant frequency p R is always less than £2, but is close to £2 when K/ £2 
is small. The height of the resonance peak, <7 max , is given approximately by 

Fo fay' 

« max ~ 2 £2 2 va: ) 

in the limit in which K/i 2 is small; o max therefore tends to infinity in this limit. In the same 
limit, the width of the resonance peak is directly proportional to K /Q. and consequently tends 
to zero. 
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General periodic driving force 

The method we have developed for the time harmonic driving force can be extended to any 
periodic driving force mF(t). A function /(/) is said to be periodic with period r if the values 
taken by / in any interval of length r are then repeated in the next interval of length r. An 
example is the ‘square wave’ function shown in Figure 5.7. The solution method requires that 
F(t) be expanded as a Fourier series.* A textbook on mechanics is not the place to develop 
the theory of Fourier series. Instead we will simply quote the essential results and then give an 
example of how the method works. To keep the algebra as simple as possible, we will suppose 
that the driving force has period 2tt ? 


Fourier’s Theorem 

Fourier’s theorem states that any function f{t) that is periodic with period 2 it can be 
expanded as a Fourier series in the form 

00 

fit) = j a o + E a„ cosnt + b„ sinnt, (5.22) 

71=1 

where the Fourier coefficients {a n } and {b„} are given by the formulae 

i r* 1 r* 

a„ = — / f(t)cosntdt, b„ = — / f{t)s\nnt dt. (5.23) 

7T J _7T 7T J _jj. 


What this means is that any function f (t) with period 2 tc can be expressed as a sum of 
time harmonic terms, each of which has period 2tt . In order to find the driven response of 
the oscillator when the force mF(t) is applied, we first expand F (t) in a Fourier scries. We 
then find the driven response that would be induced by each of the terms of this Fourier series 
applied separately, and then simply add these responses together. The method depends on the 
equation of motion being linear. 


Example 5.3 Periodic non-harmonic driving force 


Find the driven response of the damped linear oscillator 

d 2 x dx , 

— t + 2K— + tt 2 x = F(t) 
dt z dt 

for the case in which F(t) is periodic with period 2tt and takes the values 


F(t) = 


F 0 (0 < t < 7r), 

— Fq (7T < t < 27t), 


* After Jean Baptiste Joseph Fourier 1768-1830. The memoir in which he developed the theory of trigono¬ 
metric series 4 On the Propagation of Ileat in Solid Bodies' was submitted for the mathematics prize of 
the Paris Institute in 1811; the judges included such luminaries as Lagrange, Laplace and Legendre. They 
awarded Fourier the prize but griped about his lack of mathematical rigour. 

’ The general case can be reduced to this one by a scaling of the unit of time. 
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FIGURE 5.7 The ‘square wave’ input function F(/) is periodic with 
period 2n. Its value alternates between ±Fq. 


in the interval 0 < t <2 tt . This function* is shown in Figure 5.7. 


Solution 

The first step is to find the Fourier series of the function F(/). From the formula 
(5.23), the coefficient a„ is given by 

i r 71 \ r° i r* 

a„ = — I F(t) cosnt dt = — / (— Fo) cosnt dt H-/ (+Fq) cosnt dt 

TT J —jt 7T J _ jf 71 Jo 

= 0 , 

since both integrals are zero for n > 1 and are equal and opposite when n = 0. In the 
same way, 

i r n s . i r° . i r 

b„ = — / F (t) sin rtf dt = — / (— Fo) sin/// dt H-/ (+Fo) stn/i/ dt 

7T J—ji 7T J—ji 7T J o 

2Fo r 

= - / si 

7T J 0 


2f 0 r . J 

sin/// a/, 


since this time the two integrals are equal. Hence 


bn = 


2F 0 


7T 


— cos nt 


1 7r 


2/^0 / 1 — cos 


Jo 


JT 


:os H7T \ 

n ) 


2F 0 ^ 1 — (—1)” ^ 


71 


Hence the Fourier series of the function F (t) is 


oo 


^2F 0 t 

F (t) = 2 ^- I - I sin /if. 


n=\ 


n \ n 


* This function is the mechanical equivalent of a ‘square wave input’ in electric circuit theory. 
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( P yF 0 )x (pyF„)x 




FIGURE 5.8 Driven response of a damped oscillator to the alternating constant force ±«?Fo with 
angular frequency p : Left Q/p = 1.5, K/p = I. Right £2/p = 2.5, K/p = 0.1. The light 
graphs show the first term of the exapansion series. 


The next step is to find the driven response of the oscillator to the force 
m(b„ sin/if) > that is, the particular integral of the equation 


d 2 x 
dt 2 


+ 2K — + Q. 2 x = b n sin/zL 


The complex counterpart of this equaton is 


d 2 x 

It 2 


+ 2K— + Qrx = b„e"" 
dt 


(524) 


for which the particular integral is ce ,nt , where the complex amplitude c is given by 


" £2 2 - n 2 + 2iK ' 

The particular integral of the real equation (5.24) is then given by 

/ b„e" lf \ / (£l 2 — n 2 ) sin nt + 2Kn cosnt\ 

\Qr — n 2 2.iKn ) (Qr — n 2 ) 2 4K 2 n 2 ) 


Finally we add together these separate responses to find the driven response of 
the oscillator to the force mF(t). On inserting the value of the coefficient b„, this 
gives 




- 2 1± y 

7T ^ 
11 = 1 


1 — (—1)" \ ( (S3 2 — n 2 ) sin nt + 2.Kn cos lit 
(£2 2 — /i 2 ) 2 -F 4AT 2 /? 2 


(5.25) 


In order to deduce anything from this complicated formula, we must cither sum 
the series numerically or approximate the formula in some way. When Q. and K are 
both small compared to the forcing frequency p, the series (5.25) converges quite 
quickly and can be approximated (to within a few percent) by the first term. Even 
when £2 /p = 1.5 and K/p = I, this is still a reasonable approximation (sec Figure 
5.8 (left)). However, for larger values of £2/p, the higher harmonics in the Fourier 
expansion of F (t) that have frequencies close to £2 produce large contributions (see 
Figure 5.8 (right)). In this case, the scries (5.25) must be summed numerically. ■ 
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FIGURE 5.9 A simple seismograph for measuring vertical ground motion. 


5.5 A SIMPLE SEISMOGRAPH 

The seismograph is an instrument that measures the motion of the ground on which it 
stands. In real earthquakes, the ground motion will generally have both vertical and horizontal 
components, but, for simplicity, wc describe here a device for measuring vertical motion only. 

Our simple seismograph (see Figure 5.9) consists of a mass which is suspended from a 
rigid support by a spring; the motion of the mass relative to the support is resisted by a damper. 
The support is attached to the ground so that the suspension point has the same motion as 
the ground below it. This motion sets the suspended mass moving and the resulting spring 
extension is measured as a function of the time. Can we deduce what the ground motion was? 

Suppose the ground (and therefore the support) has downward displacement X ( t) at time 
t and that the extension x (/) of the spring is measured from its equilibrium length. Then the 
displacement of the mass is x + X, relative to an inertial frame. The equation of motion (5.9) 
is therefore modified to become 

d 2 { x + X) dx , 

m -r-= —(2 iiiK) - (mQ.~)x, 

dt 1 dt 

that is. 


d 2 x 

~di 2 


_ dx 7 
+ 2K — + & 2 x = 
dt 


d 2 X 
dt 2 ' 


This means that the motion of the body relative to the moving support is the same as if the 
support were fixed and the external driving force —m(d 2 X/dt 2 ) were applied to the body. 

First consider the driven response of our seismograph to a train of harmonic waves with 
amplitude A and angular frequency p, that is. 


X = A cos pt. 


The equation of motion for the spring extension .v is then 


d 2 x 

~di 2 


“I - 2A" — “I - Q~x — Ap~ cos pt. 
dt F F 
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FIGURE 5.10 Two particles are connected between three springs 
and perform longitudinal oscillations. 


The complex amplitude of the driven motion is 



— p^ + 2iKp + £2“ 


and the real driven motion is 


x = a cos (pt — y), 


where 


|-1 +2i(K /p) + (n/p) 2 \' 

Thus, providing that the spring and resistance constants are accurately known, the angular 
frequency p and amplitude A of the incident wave train can be deduced. 

In practice, things may not be so simple. In particular, the incident wave train may be a 
mixture of harmonic waves with different amplitudes and frequencies, and these are not easily 
disentangled. However, if K and £2, are chosen so that K /p and €2/p are small compared with 
unity (for all likely values of p), then c = —A and X = —x approximately. Thus, in this 
case, the record for x(t) is simply the negative of the ground motion X (/).* Since this result 
is independent of the incident frequency, it should also apply to complicated inputs such as a 
pulse of waves. 


5.6 COUPLED OSCILLATIONS AND NORMAL MODES 

Interesting new effects occur when two or more oscillators are coupled together. Fig¬ 
ure 5.10 shows a typical case in which two bodies arc connected between three springs and 
the motion takes place in a straight line. We restrict ourselves here to the classical theory in 
which the restoring forces are linear and damping is absent. If the springs are non-linear, then 
the displacements of the particles must be small enough so that the linear approximation is 
adequate. 

Let ,v and y be the displacements of the two bodies from their respective equilibrium 
positions at time t ; because two coordinates arc needed to specify the configuration, the system 
is said to have two degrees of freedom. Then, at time t , the extensions of the three springs are 
x, y — x and —y respectively. Suppose that the strengths of the three springs are a, 2a and 


* What is actually happening is that the mass is hardly moving at all (relative to an inertial frame). 
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4a respectively. Then the three restoring forces are ax, 2a(y — x ), —4 ay and the equations of 
motion for the two bodies are 


mx = —ax + 2a (y — x), 
my = —2a (y — x) — 4ay, 


which can be written in the form 


x + 3n 2 x — 2 n 2 y = 0, 

(5.27) 

y — 2 n 2 x + 6 n 2 y = 0, 

where the positive constant n is defined by n 2 = a/m. These are the governing equations 
for the motion. They are a pair of simultaneous second order homogeneous linear ODEs with 
constant coefficients. The equations are coupled in the sense that both unknown functions 
appear in each equation; thus neither equation can be solved on its own. 

The solution procedure: normal modes 

The solution procedure is simply an extension of the usual method for finding the comple¬ 
mentary function for a single homogeneous linear ODE with constant coefficients. However, 
rather than seek solutions in exponential form, it is simpler to seek solutions directly in the 
trigonometric form 


x = A cos (tot — y), 
y = B cos (cot — y), 


(5.28) 


where A, B, co and y are constants. A solution of the governing equations (5.27) that has 
the form (5.28) is called a normal mode of the oscillating system. In a normal mode, all the 
coordinates that specify the configuration of the system vary harmonically in time with the 
same frequency and the same phase ; however, they generally have different amplitudes. On 
substituting the normal mode form (5.28) into the governing equations (5.27), we obtain 

—co A cos {cot — y) + 3 n A cos (cot — y) — 2 n~B cos (cot — y) = 0, 

—co 2 B cos (cot — y) — 2ii 2 A cos (cot — y) + 6 ii 2 B cos (cot — y) = 0, 


which simplifies to give 


(3/i 2 — co 2 ) A — 2 n 2 B = 0, 

(5.29) 

—2 n 2 A + (6 n 2 — co 2 )B = 0, 

a pair of simultaneous linear algebraic equations for the amplitudes A and B. Thus a normal 
mode will exist if we can find constants A, B and co so that the equations (5.29) arc satisfied. 
Since the equations are homogeneous, they always have the trivial solution A = B = 0, 
whatever the value of co. However, the trivial solution corresponds to the equilibrium solution 
x = y = 0 of the governing equations (5.27), which is not a motion at all. We therefore require 
the equations (5.29) to have a non-trivial solution for A, B. There is a simple condition that 
this should be so, namely that the determinant of the system of equations should be zero, that 
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is 


/ 3/i 2 — co 2 
\ —2n 2 


— 2// 2 
6 n 2 — co 2 


= 0 . 


(5.30) 


On simplification, this gives the condition 

co 4 - 9 nW + 14// 4 = 0, (5.31) 

a quadratic equation in the variable co 2 . If this equation has real positive roots co 2 , co], then, 
for each of these values, the linear equations (5.29) will have a non-trivial solution for the 
amplitudes A, B. In the present case, the equation (5.31) factorises and the roots are found to 
be 


co] = 2/z 2 , co] = In 2 . (5.32) 

Hence there are two normal modes with (angular) frequencies y/2n and \fln respectively. 
These frequencies are known as the normal frequencies of the oscillating system. 

Slow mode: In the slow mode we have co 2 = 2// 2 so that the linear equations (5.29) become 

n 2 A - 2 n 2 B = 0, 

—2n 2 A + 4/z 2 Z? = 0. 


These two equations arc each equivalent to the single equation A = 2B. This is to be expected 
since, if the equations were linearly independent, then there would be no non-trivial solution 
for A and B . We have thus found a family of non-trivial solutions A = 28, B = 8, where 8 can 
take any (non-zero) value. Thus the amplitude of the normal mode is not uniquely determined ; 
this happens because the governing ODEs are linear and homogeneous. The slow normal 
mode therefore has the form 

x = 28 cos(V2//t - y), 
y= 8 cos(y/2nt — y), 


where the amplitude factor 8 and phase factor y can take any values. We sec that, in the slow 
mode, the two bodies always move in the same direction with the body on the left having twice 
the amplitude as the body on the right. 

Fast mode: In the fast mode we have a> 2 = In 2 and, by following the same procedure, we 
find that the form of the fast normal mode is 

* = 8 cos(y/7 nt -y), 

y = — 28 cos(y/lnt — y), 

where the amplitude factor 8 and phase factor y can take any values. We see that, in the fast 
mode, the two bodies always move in opposite directions with the body on the right having 
twice the amplitude as the body on the left. 


The general motion 

Since the governing equations (5.27) are linear and homogeneous, a sum of normal mode 
solutions is also a solution. Indeed, the general solution can be written as a sum of normal 


Copyrighted material 


124 


Chapter 5 Linear oscillations 



FIGURE 5.11 The two particles are attached to a light stretched 
string and perform small transverse oscillations. The 
displacements are shown to be large for clarity. 


modes. Consider the expression 

x = 25 1 cos(\/2 nt — V|) + ($2 cos(-\/7 lit — V 2 ), 

(5.35) 

y = <5i cos(\/2 lit — y\) — 2&2 cos(y/l nt — 72 )- 

This is simply a sum of the first normal mode (with amplitude factor <5| and phase factor y\) 
and the second normal mode (with amplitude factor ^2 and phase factor 72 )- Since it is possible 
to choose these four arbitrary constants so that x, y, x, y take any set of assigned values when 
t = 0, this must be the general solution of the governing equations (5.27). 

Question Periodicity of the general motion 

Is the general motion periodic? 

Answer 

The general motion is a sum of nonnal mode motions with periods t \, respectively. 
This sum will be periodic with period r if (and only if) r is an integer multiple of both 
z\ and t 2 , that is, if r\ /T 2 is a rational number. (In this case, the periods are said to be 
commensurate.) This in turn requires that co\/a >2 is a rational number. In the present 
case, u)\/u )2 = (2/7) *^ 2 , which is irrational. The general motion is therefore not 
periodic in this case. ■ 

We conclude by solving another typical nonnal mode problem. 


Example 5.4 Small transverse oscillations 

Two particles P and Q, of masses 2m and in, are secured to a light string that is 
stretched to tension To between two fixed supports, as shown in Figure 5.11. The 
particles undergo small transverse oscillations perpendicular to the equilibrium line 
of the string. Find the nonnal frequencies, the forms of the nonnal modes, and the 
general motion of this system. Is the general motion periodic? 
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Solution 

First wc need to make some simplifying assumptions.* We will assume that the trans¬ 
verse displacements x, y of the two particles are small compared with a\ the three 
sections of the string then make small angles with the equilibrium line. We will also 
neglect any change in the tensions of the three sections of string. 

The left section of string then has constant tension 7q. When the particle P is 
displaced, this tension force has the transverse component —To sin 9, which acts as 
a restoring force on P; since 0 is small, this component is approximately —Tox/a. 
Similar remarks apply to the other sections of string. The equations of transverse 
motion for P and Q are therefore 

Tpx To(y-x) 

I » 

a a 

Tp(y—x) _ Toy_ 
a 2 a 

which can be written in the form 

2x + 2/i 2 x — n 2 y = 0, 

2 y — 2 n 2 x + 3 n 2 y = 0, 

where the positive constant n is defined by n 2 = Tp/ma. 

These equations will have normal mode solutions of the form 

x = A cos (cot — y), 
y = B cos (cot — y), 

when the simultaneous linear equations 

{In 2 — 2 co 2 )A — n 2 B = 0, 

-In 2 A + (3/i 2 - 2 co 2 )B = 0, 

have a non-trivial solution for the amplitudes A, B. The condition for this is 

2 n 2 — 2a) 2 —n 2 \ 

—n 2 3/i 2 — 2a> 2 J 

On simplification, this gives 

2co 4 - 5 /i V + 2// 4 = 0, 

a quadratic equation in the variable a; 2 . This equation factorises and the roots are 
found to be 



(5.39) 


(5.40) 


(5.36) 

(5.37) 


2 nix = 
my = 


co 2 = jn 2 , co 2 = 2// 2 . (5.41) 

Hence there arc two normal modes with normal frequencies n/*J2 and \[2n respec¬ 
tively. 


* These assumptions are consistent with the more complete treatment given in Chapter 15. 
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Slow mode: In the slow mode we have or = ii 2 /2 so that the linear equations (5.38) 
become 


n 2 A — n 2 B — 0, 
—2ii 2 A + 2 n 2 B = 0. 


These two equations are each equivalent to the single equation A = B so that we have 
the family of non-trivial solutions A = 8 , B = 8 , where 8 can take any (non-zero) 
value. The slow normal mode therefore has the form 

x = 8 cos(nt/y /2 -y), 
y = 8 cos(/jf/\/2 — y), 

where the amplitude factor 8 and phase factor y can take any values. We see that, in 
the slow mode, the two particles always have the same displacement. 

Fast mode: In the fast mode we have or = 2 n 2 and, by following the same proce¬ 
dure, we find that the form of the fast normal mode is 


x = 8 cos{\flnt — y), 

y = — 28 cos(*j 2 lit — y), 


(5.43) 


where the amplitude factor 8 and phase factor y can take any values. We see that, 
in the fast mode, the two particles always move in opposite directions with Q having 
twice the amplitude of P. 

The general motion is now the sum of the first normal mode (with amplitude 
factor <5| and phase factor y\) and the second normal mode (with amplitude factor 82 
and phase factor y 2 ). This gives 

x = <5i cos(nt/y /2 — y\) + 82 Coa(\/ 2 iit — y 2 ), 

(5.44) 

y = <$1 cos(nt/\/2 — y \) — 282 cos(\/2//t — y^)- 

For this system x\lx 2 = 102 /eo\ = 2 so that the general motion is periodic with 
period x\ — 2 \/ 2 tt/ii. ■ 


Problems on Chapter 5 

Answers and comments are at the end of the book. 
Harder problems carry a star (*). 


Free linear oscillations 

5.1 A certain oscillator satisfies the equation 

x + Ax = 0. 

Initially the particle is at the point .v = \/3 when it is projected towards the origin with speed 
2. Show that, in the subsequent motion, 

x = \/3cos2 1 — sin2f. 
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Deduce the amplitude of the oscillations. How long does it take for the particle to first reach 
the origin? 

5.2 When a body is suspended from a fixed point by a certain linear spring, the angular 
frequency of its vertical oscillations is found to be £2). When a different linear spring is used, 
the oscillations have angular frequency £22. Find the angular frequency of vertical oscillations 
when the two springs arc used together (i) in parallel, and (ii) in series. Show that the first of 
these frequencies is at least twice the second. 

5.3 A particle of mass m moves along the jc-axis and is acted upon by the restoring force 
—m(n 2 + k 2 )x and the resistance force — 2mkx , where /», k are positive constants. If the 
particle is released from rest at x = a, show that, in the subsequent motion, 

a _ k . 

x = — e (n cos/// + k sin/if). 
n 

Find how far the particle travels before it next comes to rest. 

5.4 An overdamped harmonic oscillator satisfies the equation 

Jr + 10* + 16* = 0. 

At time t = 0 the particle is projected from the point * = 1 towards the origin with speed u. 
Find x in the subsequent motion. 

Show that the particle will reach the origin at some later time 1 if 

u -8 

How large must u be so that the particle will pass through the origin? 

5.5 A damped oscillator satisfies the equation 

Jr + 2 Kx + £2 2 * = 0 

where K and £2 are positive constants with K < £2 (under-damping). At time t = 0 the 
particle is released from rest at the point x = a. Show that the subsequent motion is given by 

-k, ( K • \ 

x = ae lcos£2/;M-sin £2/jf I, 

V &D ) 

where £2^ = (£2 2 — A' 2 ) 1 / 2 . 

Find all the turning points of the function x(t) and show that the ratio of successive max¬ 
imum values of* is e ~ 27TK / Si D 

A certain damped oscillator has mass 10 kg, period 5 s and successive maximum values 
of its displacement arc in the ratio 3:1. Find the values of the spring and damping constants 
a and /3. 

5.6 Critical damping Find the general solution of the damped SHM equation (5.9) for the 
special case of critical damping, that is, when K = £2. Show that, if the particle is initially 
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released from rest at x = a, then the subsequent motion is given by 


x = ae~ nt (1 + Qt). 


Sketch the graph of x against t. 

5 . 7 * Fastest decay The oscillations of a galvanometer satisfy the equation 

x + 2 Kx + &x = 0. 

The galvanometer is released from rest with x = a and we wish to bring the reading perma¬ 
nently within the interval — €a < x < eo as quickly as possible, where e is a small positive 
constant. What value of K should be chosen? One possibility is to choose a sub-critical value 
of K such that the first minimum point of x(t) occurs when x = —ea. [Sketch the graph of 
x(t) in this case.] Show that this can be acheived by setting the value of K to be 



If K has this value, show that the time taken for x to reach its first minimum is approximately 
£2 -1 ln(l/c) when c is small. 

5.8 A block of mass M is connected to a second block of mass m by a linear spring of natural 
length 8 a. When the system is in equilibrium with the first block on the floor, and with the 
spring and second block vertically above it, the length of the spring is la. The upper block is 
then pressed down until the spring has half its natural length and is then resleased from rest. 
Show that the lower block will leave the floor if M < 2m. For the case in which M = 3////2, 
find when the lower block leaves the floor. 

Driven linear oscillations 

5.9 A block of mass 2kg is suspended from a fixed support by a spring of strength 
2000 Nm . The block is subject to the vertical driving force 36cos pt N. Given that the 
spring will yield if its extension exceeds 4 cm, find the range of frequencies that can safely be 
applied. 

5.10 A driven oscillator satisfies the equation 

x + £2 2 x = Fq cos|£2(l + e)t], 

where c is a positive constant. Show that the solution that satisfies the initial conditions x = 0 
and x = 0 when t = 0 is 



Sketch the graph of this solution for the case in which e is small. 

5.11 Figure 5.12 shows a simple model of a car moving with constant speed c along a gently 
undulating road with profile h(x), where h'(x) is small. The car is represented by a chassis 
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which keeps contact with the road, connected to an upper mass /// by a spring and a damper. 
At time t the upper mass has displacement y(t) above its equilibrium level. Show that, under 
suitable assumptions, y satisfies a differential equation of the form 

y + 2 Ky + £2 2 y = 2Kc/i'(ct) + & 2 h(ct) 

where K and £2 are positive constants. 

Suppose that the profile of the road surface is given by h{x) = ho cos(px/c), where ho 
and p are positive constants. Find the amplitude a of the driven oscillations of the upper mass. 
The vehicle designer adjusts the damper so that K = £1. Show that 

whatever the values of the consants Q, and p. 

5.12 Solution by Fourier series A driven oscillator satisfies the equation 

x + 2Kx + tt 2 x = F(t), 

where K and Q, are positive constants. Find the driven response of the oscillator to the saw 
tooth’ input, that is, when F (t) is given by 

F (t) = Fo t (-7T < t < tt) 

and F(t) is periodic with period 2jt. [It is a good idea to sketch the graph of the function 

mi 

Non-linear oscillations that are piecewise linear 

5.13 A particle of mass m is connected to a fixed point O on a smooth horizontal table by 
a linear clastic string of natural length 2 a and strength mQ. 2 . Initially the particle is released 
from rest at a point on the table whose distance from O is 3a. Find the period of the resulting 
oscillations. 


5.14 Coulomb friction The displacement .v of a spring mounted mass under the action of 
Coulomb friction satisfies the equation 


x + Qrx = 


-Fo 

Fo 


x > 0 
x < 0 
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where £2 and Fq are positive constants. If |x| > Fq/Q. 2 when x = 0, then the motion con¬ 
tinues; if |jc | < Fo/Q 2 when x = 0, then the motion ceases. Initially the body is released 
from rest with x = 9Fo/2£2 2 . Find where it finally comes to rest. How long was the body in 
motion? 

5.15 A partially damped oscillator satisfies the equation 

x + 2 k x + £2 2 x = 0, 

where £2 is a positive constant and k is given by 

I 0 x < 0 
K ~ \ K x > 0 

where A" is a positive constant such that K < £2. Find the period of the oscillator and the ratio 
of successive maximum values of*. 

Normal modes 

5.16 A particle P of mass 3/// is suspended fron a fixed point O by a light linear spring with 
strength a. A second particle Q of mass 2 in is in turn suspended from P by a second spring of 
the same strength. The system moves in the vertical straight line through O. Find the normal 
frequencies and the form of the normal modes for this system. Write down the form of the 
general motion. 

5.17 Two particles P and Q , each of mass m, are secured at the points of trisection of a 
light string that is stretched to tension 7 q between two fixed supports a distance 3 a apart. The 
particles undergo small transverse oscillations perpendicular to the equilibrium line of the 
string. Find the normal frequencies, the forms of the nonrial modes, and the general motion of 
this system. [Note that the forms of the modes could have been deduced from the symmetry 
of the system.] Is the general motion periodic? 

5.18 A particle P of mass 7>m is suspended from a fixed point O by a light incxtcnsiblc string 
of length a. A second particle Q of mass m is in turn suspended from P by a second string of 
length a. The system moves in a vertical plane through O. Show that the linearised equations 
of motion for small oscillations near the downward vertical arc 

4 0 + 4> + 4n 2 6 = 0, 

0 T 0 T - n~(f) = 0, 

where 0 and 0 are the angles that the two strings make with the downward vertical, and n 2 = 
g/a. Find the normal frequencies and the forms of the normal modes for this system. 
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KEY FEATURES 

The key features of this chapter are the energy principle for a particle, conservative fields of 
force, potential energies and energy conservation. 

In this Chapter, we introduce the notion of mechanical energy and its conservation. 
Although energy methods are never indispensible* for the solution of problems, they do give 
a greater insight and allow many problems to be solved in a quick and elegant manner. Energy 
has a fundamental role in the Lagrangian and Hamiltonian formulations of mechanics. More 
generally, the notion of energy has been so widely extended that energy conservation has 
become the most pervasive and important principle in the whole of physics. 


6.1 THE ENERGY PRINCIPLE 


Suppose a particle P of mass m moves under the influence of a force F. Then its 
equation of motion is 



( 6 . 1 ) 


where v is the velocity of P at time t. At this stage we place no restrictions on the force F . 
It may depend on the position of P , the velocity of P , the time, or anything else; if more than 
one force is acting on P , then F means the vector resultant of these forces. On taking the 
scalar product of both sides of equation (6.1) with v, we obtain the scalar equation 


mv • — = F • v 
dt 


and, since 



Energy is never mentioned in the work of Newton! 
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this can be written in the form 



( 6 . 2 ) 


where T = jinv • v. 

Definition 6.1 Kinetic energy The scalar quantity T = \mv • v = \m\v\ 2 is called the 
kinetic energy of the particle P. 


If we now integrate equation (6.2) over the time interval [0, t 2 ], we obtain 


T 2 — T | = f F • v dt 


(6.3) 


where T\ and 73 are the kinetic energies of P at times t\ and t 2 respectively. This is the energy 
principle for a particle moving under a force F. 

Definition 6.2 1-D work done The scalar quantity 



F • v dt 


(6.4) 


is called the work done by the force F during the time interval [f|, t 2 \. The rate of working of 
F at time t is thus F • v. 

[The SI unit of work is the joule (J) and one joule per second is one watt (W).j 
Our result can now be stated as follows: 


Energy principle for a particle 

In any motion of a particle, the increase in the kinetic energy of the particle in a given 
time interval is equal to the total work done by the applied forces during this time inter¬ 


val. 


The energy principle is a scalar equality which is derived by integrating the vector equa¬ 
tion of motion (6.1). Thus the energy principle will generally contain less information than 
the equation of motion, so that we have no right to expect the motion of P to be determined 
from the energy principle alone. The situation is simpler when P has one degree of freedom, 
which means that the position of P can be specified by a single scalar variable. In this case 
the equation of motion and the energy principle arc equivalent and the energy principle alone 
is sufficient to determine the motion. 

Example 6.1 Verify the energy principle 

A man of mass 100 kg can pull on a rope with a maximum force equal to two fifths 
of his own weight. I Take g = 10 ms -2 . ] In a competition, he must pull a block of 
mass 1600 kg across a smooth horizontal floor, the block being initially at rest. lie is 
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able to apply his maximum force horizontally for 12 seconds before falling exhausted. 
Find the total work done by the man and confirm that the energy principle is true in 
this case. 

Solution 

In this problem, the block is subjected to three forces: the force exerted by the man, 
uniform gravity, and the vertical reaction of the smooth floor. However, since the last 
two of these are equal and opposite, they can be ignored. 

The man has weight 1000 N so that the force he applies to the block is a constant 
200 N. The Second Law then implies that, while the man is pulling on the rope, the 
block must have constant rectilinear acceleration 200/1600 = 1/8 ms -2 . Since the 
block is initially at rest, its velocity v at time t is therefore v — //8 ms -1 . The total 
work W done by the man is then given by the formula (6.4) to be 

200 (0 dt =i8ooj ‘ 

When t = 12 s, the block has velocity v = 12/8 = 3/2 m s -1 , so that the final kinetic 
energy of the block is ^ (1600) (3/2) 2 = 1800 J. Since the initial kinetic energy of the 
block is zero, the kinetic energy of the block increases by 1800 J, the same as the 
work done by the man. This confirms the truth of the energy principle. ■ 



6.2 ENERGY CONSERVATION IN RECTILINEAR MOTION 

The energy principle is not normally used in the general form (6.3). When possible, it 
is transformed into a conservation principle. This is most easily illustrated by the special case 
of rectilinear motion. 

Suppose that the particle P moves along the x-axis under the force F acting in the positive 
,v-direction. In this case, the ‘work done’ integral (6.4) reduces to 

rh 

W = / Fvdt, 

where v — x is the velocity of P in the positive x -direction. For the case in which F is a force 
field (so that F = F(,v)), the formula for W becomes 

dx f x 2 

F(x) — dt= F(x) dx, 

dt J x\ 

where ,V| = x(f|) and X 2 = xfo). Thus, when P moves over the interval [jci , * 2 ] of the ar-axis, 
the work done by the field F is given by 



r* 2 

W = / F(x)dx 

Jx I 


(6.5) 
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(This is a common definition of the work done by a force F. It can be used when F = F{x), 
but not in general.) It follows that the energy principle for a particle moving in a rectilinear 
force field can be written 

T 2 — T\ = f 2 F(x)dx. 

Jx\ 

Now let V (.*) be the indefinite integral of — F(x), so that 
dV f X2 

F = -and / F (x) dx = V (x\) — V (X 2 ). (6.6) 

dx J X{ 

Such a V is called the potential energy* function of the force field F. In terms of V , the 
energy principle in rectilinear motion can be written 

T 2 + V(x 2 ) = T l + V(x l ), 

which is equivalent to the energy conservation formula 


T + V = E 


( 6 . 7 ) 


where E is a constant called the total energy of the particle. This result can be stated as 
follows: 


Energy conservation in rectilinear motion 

When a particle undergoes rectilinear motion in a force field, the sum of its kinetic and 
potential energies remains constant in the motion. 


Example 6.2 Finding potential energies 

Find the potential energies of (i) the (one-dimensional) SHM force field, (ii) the (one¬ 
dimensional) attractive inverse square force field. 

Solution 

(i) The one-dimensional SHM force field is F = —ax, where a is a positive constant. 
The corresponding V is given by 

F (x) dx = a I xdx, 

J a 

where a, the lower limit of integration, can be arbitrarily chosen. (This corresponds 
to the arbitrary choice of the constant of integration.) Note that, by beginning the 



* The potential energy corresponding to a given F is uniquely determined apart from a constant of integra¬ 
tion; this constant has no physical significance. 
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integration at x = a , we make V (a) = 0. In the present case it is conventional to take 
a = 0 so that V = 0 at x = 0. With this choice, the potential energy is V = jctx 2 . 

(ii) The one-dimensional attractive inverse square force field is F = —K /x 2 , where 
x > 0 and K is a positive constant. The corresponding V is given by 

V = - J F(x)dx = K J -^dx. 

This time it is not possible to take a = 0 (the integral would then be meaningless) 
and it is conventional to take a = +oo; this makes V = 0 when x = +oc. With this 
choice, the potential energy is V = —K /x (,v > 0). ■ 

Example 6.3 Rectilinear motion under uniform gravity 


A particle P is projected vertically upwards with speed u and moves under uniform 
gravity. Find the maximum height achieved and the speed of P when it returns to its 
starting point. 

Solution 

Suppose that P is projected from the origin and moves along the z-axis, where Oz 
points vertically upwards. The force F exerted by uniform gravity is F = —mg and 
the corresponding potential energy V is given by 

V = — I (-nig)dz = mgz. 

Jo 

Energy conservation then implies that 

jmv 2 + mgz = E , 

where v = z, and the constant E is determined from the initial condition v = u when 
z = 0. This gives E = \mu 2 so that the energy conservation equation for the motion 
is 


\mv 2 + mgz = jmu 2 . 

Since v = 0 when z = z m ax ? it follows that z m ax = w 2 /(2#). This result was obtained 
from the Second Law in Chapter 4. When P returns to O, z = 0 and so |u| = w. Thus 
P returns to O with speed w, the projection speed. ■ 

Example 6.4 Simple harmonic motion 

A particle of mass m is projected from the point x = a with speed u and moves along 
the .Y-axis under the SHM force field F = —men 2 .x. Find the maximum distance from 
O and the maximum speed achieved by the particle in the subsequent motion. 

Solution 

The potential energy correspondingto the force field F — —mcn 2 x is V = jcn 2 x 2 . 
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Energy conservation then implies that 

| mv 2 + |///a > 2 * * 2 = £, 

where v = *, and the constant E is determined from the initial condition |v| = u 
when* = a. This gives £ = \m(u 2 +a> 2 a 2 ) so that the energy conservation equation 
for the motion becomes 


v 2 + co 2 x 2 = u 2 + co 2 a 2 . 


Since v = 0 when |*| takes its maximum value, it follows that 

,2 \ 1/2 


1 * 1 , 


-( 5 *-) 


Also, since the left side of the energy conservation equation is the sum of two positive 
terms, it follows that |u| takes its maximum value when * = 0. Hence 

1/2 


Manx = (w 2 + (0 2 a 2 ) 


These results could also be obtained (less quickly) by using the methods described in 
Chapter 5. ■ 


6.3 GENERAL FEATURES OF RECTILINEAR MOTION 

The energy conservation equation 

±mv 2 + V(x) = E (6.8) 

enables us to deduce the general features of rectilinear motion in a force field. Since T > 0 
(and is equal to zero only when v = 0) it follows that the position of the particle is restricted 
to those values of* that satisfy 


V(x) < E, 

and that equality will occur only when v = 0. Suppose that V (*) has the form shown in Figure 
6.1 and that £ has the value shown. Then the motion of P must take place either (i) in the 
bounded interval a < * < b, or (ii) in the unbounded interval c < x < oc. Thus, if the particle 
was situated in the interval [a, b] initially, this is the interval in which the motion will take 
place. 

Bounded motions 

Suppose that the motion is started with P in the interval [o, b] and with v positive, so that P is 
moving to the right. Then, since v can only be zero at * = a and * = b, v will remain positive 
until P reaches the point * = b, where it comes to rest*. From equation (6.8), it follows that 


Strictly speaking, we should exclude the possibility that P might approach the point * = b asymptotically 
as t -*■ oo, and never actually get there. This can happen, but only in the case in which the line V = E 
is a tangent to the graph of F(*) at * = b. In the general case depicted in Figure 6.1, £ does arrive at 

* = b in a finite time. 
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V 


bounded 


unbounded 


E - 


c 



* 


X 


FIGURE 6.1 Bounded and unbounded motions in a rectilinear force 
field. 


the ODE that governs this ‘right’ part of the motion is 



At the point x = b, V' > 0 which implies that F < 0. P therefore moves to the left and does 
not stop until it reaches the point x = a. The ODE that governs this ‘left’ part of the motion is 


dx 

It 


[2{E-V{x))] {/1 . 


At the point x = a, V' < 0, which implies that F > 0 and that P moves to the right once 
again. The result is that P performs periodic oscillations between the extreme points x = a 
and x = b. Since the ‘left’ and ‘right’ parts of the motion take equal times, the period r of 
these oscillations can be found by integrating either equation over the interval a < x < b. 
Each equation is a separable ODE and integration gives 



It should be noted that these oscillations are generally not simple harmonic. In particular, their 
period is amplitude dependent. 

Example 6.5 Periodic oscillations 

A particle P of mass 2 moves on the positive x-axis under the force field F = 
(4/x 2 ) — I. Initially P is released from rest at the point x = 4. Find the extreme 
points and the period of the motion. 

Solution 

The force field F has potential energy V = (4/x) +x, so that the energy conservation 
equation for P is 


\{2)v 2 + {4/x)+x = E, 
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where v = x and E is the total energy. The initial condition v = 0 when x = 4 gives 
E = 5 so that 


v 2 = 5- (4/jc) - a:. 


The extreme points of the motion occur when v = 0, that is, when x = 1 and * = 4. 
To find the period t of the oscillations, write v = dx/dt in the last equation and take 
square roots. This gives the separable ODEs 


dx 

~dt 


(x - l)(4-x) 


n 1/2 


X 


where the plus and minus signs refer to the motion of P in the positive and negative 
x-directions respectively. Integration of either equation gives 


=2 n _ 

J i L (•*■ — 1) (4 - 


x)J 


1/2 


dx 


9.69. ■ 


Unbounded motions 

Suppose now that the motion is started with P in the interval [c, oo) and with v negative, so 
that P is moving to the left. Then, since v can only be zero at x = c, v will remain negative 
until P reaches the point x = c, where it comes to rest. At the point x = c, V' <0 which 
implies that F > 0. P therefore moves to the right and continues to do so indefinitely. 


Stable equilibrium and small oscillations 

First, we define what we mean by an equilibrium position. 

Definition 6.3 Equilibrium The point A is said to be an equilibrium position of P if, 

when P is released from rest at A, P remains at A. 

In the case of rectilinear motion under a force field F(x), the point x — a will be an 
equilibrium position of P if (and only if) F(a) = 0, that is, if V'{a) = 0. It follows that 
the equilibrium positions of P are the stationary points of the potential energy function V (x). 
Consider the equilibrium positions shown in Figure 6.2. These occur at stationary points of V 
that are a minimum and a maximum respectively. Suppose that P is at rest at the minimum 
point x = a when it receives an impulse of magnitude J which gives it kinetic energy A E 
(= J 2 /2m). The total energy of P is now E + A E, and so P will oscillate in the interval 
\a — Si, a + <$ 2 ! shown. It is clear from Figure 6.2 that, as the magnitude of J (and therefore 
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A E) tends to zero, the ‘amplitude’ 8 of the resulting motion (the larger of <5| and £ 2 ) also tends 
to zero. This is the definition of stable equilibrium. 

Definition 6.4 Stable equilibrium Suppose that a particle P is in equilibrium at the point 
A when it receives an impulse of magnitude J; let 8 be the amplitude of the subsequent motion. 
If 8 —► 0 as J —> 0. then the point A is said to be a position of stable equilibrium of P. 

On the other hand, if P is at rest at the maximum point x = b when it receives an impulse 
of magnitude 7, it is clear that the amplitude of the resulting motion does not tend to zero as 
J tends to zero, so that a maximum point of V ( x ) is not a position of stable equilibrium of P. 
The same applies to stationary inflection points. 


Equilibrium positions of a particle 

The stationary points of the potential energy V (,v) are the equilibrium points of P and 
the minimum points of V {x) are the positions of stable equilibrium. If A is a position of 
stable equilibrium, then P can execute small-amplitude oscillations about A. 


Approximate equation of motion for small oscillations 

Suppose that the point x = a is a minimum point of the potential energy V (jc). Then, when x 
is sufficiently close to a, we may approximate V (jc) by the first three terms of its Taylor series 
in powers of the variable (x — a), as follows: 

V(x) = V(a) + (x-a)V'(a) +±(x-a) 2 V"(a) 

= V(a)+ i(x -a) 2 V"(a), (6.9) 

since V'(a) — 0. Thus, for small amplitude oscillations about jc = a, the energy conservation 
equation is approximately 

\mv 2 + V(a) + \{x -a) 2 V'\a) = E. 

If we now differentiate this equation with respect to t (and divide by v), we obtain the approx¬ 
imate (linearised) equation of motion 

d 2 x 

m + V"(a) (jc — a) = 0. 

Provided that V"{a) > 0, this is the equation for simple harmonic oscillations with angular 
frequency (V"{a)/m) x ^ 2 about the point x = a. The small oscillations of P about ,c = a are 
therefore approximately simple harmonic with approximate period r = 2 7t(iii/V"( a))^ 2 . 

Example 6.6 Finding the period of small oscillations 

A particle P of mass 8 moves on the x-axis under the force field whose potential 
energy is 

jc(jc - 3) 2 

V = — -—. 

3 
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FIGURE 6.3 Particle P is in general motion 
under the force F. The arc C is the path taken 
by P between the points A and B. 



Show that there is a single position of stable equilibrium and find the approximate 
period of small oscillations about this point. 

Solution 

For this V, V' = x 2 — 4x + 3 and V" = 2x — 4. The equilibrium positions occur when 
V' = 0, that is when x = 1 and x = 3. Since ^"(l) = —2 and V"(3) = 2, we deduce 
that the only position of stable equilibrium is at x = 3. The approximate period r of 
small oscillations about this point is therefore given by r = 27r(8/ V" = 47T. ■ 

6.4 ENERGY CONSERVATION IN A CONSERVATIVE FIELD 

Suppose now that the particle P is in general three-dimensional motion under the 
force F and that, in the time interval [ 64 , tg], P moves from the point A to the point B along 
the path C, as shown in Figure 6.3. Then, by the energy principle (6.3), 

T b -T a = f* F-vdt, (6.10) 

Jt A 

where Ta and 7# are the kinetic energies of P when t = t a and t = respectively. When 
F is a force field F(r), the ‘work done’ integral on the right side of equation (6.10) can be 
written in the form 


r' l{ dr f 

F • v dt = / F(r) •—dt = / F(r)'dr, 

Jt A dt Jc 

where C is the path taken by P in the time interval 1 1 a, tn\. It follows that the energy principle 
for a particle moving in a 3D force field can be written 

T b -T a = J^FirJ-dr. ( 6 . 11 ) 

Integrals like that on the right side of equation (6.11) are called line integrals. They differ 
from ordinary integrals in that the range of integration is not an interval of the jc-axis, but a 
path in three-dimensional space. Line integrals are treated in detail in texts on vector field 
theory (see for example Schey [11]), but their physical meaning in the present context is clear 
enough. The quantity F • dr is the infinitesimal work done by F when P traverses the element 
d r of the path C. The line integral sums these contributions to give the total work done by F. 
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The line intgral of F along C is taken to be the definition of the work done by the force 
field F(r) when its point of application moves along any path C that connects A and B. 

Definition 6.5 3-D work done The expression 

W[A -► B ; C] = J F(r) ‘dr (6.12) 

is called the work done by the force field F(r) when its point of application moves from A to 
B along any path C. 

The above definition is more than just an alternative definition of the work done by a force field acting 
on a particle. It defines the quantity W\ A —> B \ C \ whether or not C is an actual path traversed by the 
particle P . In this wider sense, the concept of ‘work done’ is purely notional, as is the concept of the ‘point 
of application of the force’. In this sense, W exists for all paths joining the points A and B , but W should 
be regarded as the real work done by F only when C is an actual path of a particle moving under the field 
F(r). 

Energy conservation 

In order to develop an energy conservation principle for the general three-dimensional 
case, we need the right side of equation (6.11) to be expressible in the form 

f F(r)-dr =V AX -V B , (6.13) 

for some scalar function of position V (r), where V A and Vg are the values of V at the 
points A and B. In the rectilinear case, there was no difficulty in finding such a V (it was 
the indefinite integral of —F(x)). In the general case however, it is far from clear that any 
such V should exist. For, if there does exist a function V (r) satisfying equation (6.13), 
this must mean that the line integral W[ A —> B ; C] has the same value for all paths 
C that connect the points A and B. There is no reason why this should be true and, in 
general, it is not true. There is however an important class of fields F(r) for which V (r) 
does exist, and it is these fields that we shall consider from now on. 

Definition 6.6 Conservative field If the field F(r ) can be expressed in the form* 

F = — grad V, (6.14) 

where V (r) is a scalar function of position, then F is said to be a conservative field and 
the function V is said to be the potential energy function' for F. 


* If xf;(r) is a scalar field then grad xjr is the vector field defined by 

dxjs d\/r dx/z 

grad xlr = —i + — j + — k. 

3jt dy dz 

Thus if \j/ = jcy 3 z 5 , then grad xj/ = y 3 z 5 / + 3 xy 2 z* j + 5 xy^z^k. We could omit the minus sign in the 

definition (6.14), but the potential energy of F would then be — V instead of V. 

7 If V exists, then it is unique apart from a constant of integration. As in the rectilinear case, this constant 
has no physical significance. 
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Example 6.7 Conservative or not conservative? 

Show that the field F \ = —2 xi — 2yj — 2zk is conservative but that the field F 2 = 
yi — x j is not. 

Solution 

(i) If F 1 is conservative, then its potential energy V must satisfy 


dv 

dx 


= —2x, 



and these equations integrate to give 


dv 

dz 


= —2 z, 


V = x 2 + p(y, z), V = y 2 -\-q(x,z), 


V = z 2 + r(x, y ), 


where p , q and r arc ‘constants’ of integration, which, in this case, are functions of the 
other variables. If V really exists, then these three representations of V can be made 
identical by making special choices of the functions p, q and r. In this example it is 
clear that this can be achieved by taking p = y 2 + z 2 ,q = x 2 + z 2 , and r = x 2 + y 2 . 
Hence F\ = — grad(,v 2 + y 2 + z 2 ) and so F \ is conservative. 

(ii) If F 2 is conservative, then its potential energy V must satisfy 





There is no V that satisfies these equations simultaneously. The easiest way to show 
this is to observe that, from the first equation, d 2 V/dydx = —1 while, from the 
second equation, d 2 V/dxdy — +1. Since these mixed partial derivatives of V should 
be equal, we have a contradiction. The conclusion is that no such V exists and that 
F 2 is not conservative. ■ 

Suppose now that the field F(r) is conservative with potential energy V (r) and let C 
be any path connecting the points A and B. Then 


W[ A -+ B ; C] = J F(r) • dr = J (-grad V)-dr 

8V 8V dV \ 

— i + — j + — k J • (dx i + dy j + dzk) 


-u 

r dv dv dv 

= — I — dx H- d v H- dz 

Jc dx dy ■ dz 

=-L dV= 


V A -V H . 


Thus, when F is conservative with potential energy V , 


(6.15) 


F(r) • dr = Va — Vb, 
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for any path C connecting the points A and B. The energy principle (6.11) can therefore 
be written 


Tb + Vb — Ta + Va 

which is equivalent to the energy conservation formula 


T + V = E 


(6.16) 


Our result can be summarised as follows: 


Energy conservation in 3-D motion 

When a particle moves in a conservative force field, the sum of its kinetic and 
potential energies remains constant in the motion. 


The condition that F be conservative seems restrictive, but most force fields encoun¬ 
tered in mechanics actually are conservative! 

Example 6.8 Finding 3-D potential energies 

(a) Show that the uniform gravity field F = —mgk is conservative with potential 
energy V = mgz. 

(b) Show that any force field of the form 

F = h(r)7 

(a central field) is conservative with potential energy V — —H(r), where H (r) is the 
indefinite integral of h(r). Use this result to find the potential energies of (i) the 3-D 
SHM field F = —ar'r, and (ii) the attractive inverse square field F = —{K/r 1 )'r, 
where a and K are positive constants. 

Solution 

Since the potential energies are given, it is sufficient to evaluate — grad V in each case 
and show that this gives the appropriate F. Case (a) is immediate. In case (b), 

dll(r) dll dr . x x 

-— =-= H\r)~ = A (/•)-, 

8x dr dx r r 

since r = ( x 2 + y 2 + z 2 ) and II'{r) = h(r). Thus 

- grad [-//(/)] = /,(r) (-i+ y -j+-k) = h(r)— = l,(r)r , 

\r r r / r 

as required. 
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In particular then, the potential energy of the SHM field F = —ar'r is V = 
\ar 2 , and the potential energy of the attractive inverse square field F = —( K/r 2 )T 
is V = -Kir. ■ 

Example 6.9 Projectile motion 

A body is projected from the ground with speed u and lands on the flat roof of a build¬ 
ing of height h. Find the speed with which the projectile lands. [Assume uniform 
gravity and no air resistance.! 

Solution 

Since uniform gravity is a conservative field with potential energy mgz, energy con¬ 
servation applies in the form 

jin\v\ 2 + mgz = E, 

where O is the initial position of the projectile and Oz points vertically upwards. 
From the initial conditions, E = i mu 2 . Hence, when the body lands, 

\m | v L | 2 + mg/i = ^mu 2 , 

where v L is the landing velocity. The landing speed is therefore 

\v L \ = (u 2 -2gh^ 7 . 

Thus, energy conservation determines the speed of the body on landing, but not its 
velocity. ■ 


Example 6.10 Escape from the Moon 

A body is projected from the surface of the Moon with speed u in any direction. Show 
that the body cannot escape from the Moon if u 2 < IMG/R. where M and R are the 
mass and radius of the Moon. [Assume that the Moon is spherically symmetric.] 

Solution 

If the Moon is spherically symmetric, then the force F that it exerts on the body is 
given by F = —( mMG/r 2 )7\ where in is the mass of the body, and r is the position 
vector of the body relative to an origin at the centre of the Moon. This force is a 
conservative field with potential energy V = —mMG/r. Hence energy conservation 
applies in the form 


, in MG 

\m M 2 -- = E, 


# * * 10 

and, from the initial conditions, E — jinn* — (inMG)/R. Thus the energy conserva¬ 
tion equation is 


\v\ 2 = u 2 + 2MG 
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Since the left side of the above equation is positive , the values of r that occur in the 
motion must satisfy the inequality 

u 2 + 2MG > 0. 

If the body is to escape, this inequality must hold for arbitrarily large r. This means 
that the condition 


R ~ 

is necessary for escape. Hence if w 2 < 2MG / R. the body cannot escape. The inter¬ 
esting feature here is that the ‘escape speed' is the same for ail directions of projection 
from the surface of the Moon. (The special case in which the body is projected verti¬ 
cally upwards was solved in Chapter 4.) ■ 

Example 6.11 Stability of equilibrium in a 3-D conservative field 


A particle P of mass m can move under the gravitational attraction of two particles, 
of equal mass A/, fixed at the points (0, 0, ±o). Show that the origin O is a position 
of equilibrium, but that it is not stable. [This illustrates the general result that no 
free-space static gravitational field can provide a position of stable equilibrium.! 

Solution 

When P is at O, the fixed particles exert equal and opposite forces so that the total 
force on P is zero. The origin is therefore an equilibrium position for P. 

Just as in rectilinear motion, O will be a position of stable equilibrium if the 
potential energy function V (x, y , z) has a minimum at O. This means that the value 
of V at O must be less than its values at all nearby points. But at points on the z-axis 
between z = —a and z = a 


m MG in MG 

V(0, 0, z) =-— 

a — z a + z 


2am MG 
a 2 — z 2 


which has a maximum at z = 0. Hence the equilibrium at O is unstable to disturbances 
in the z-direction. ■ 


6.5 ENERGY CONSERVATION IN CONSTRAINED MOTION 

Some of the most useful applications of energy conservation occur when the mov¬ 
ing particle is subject to geometrical constraints, such as being connected to a fixed point 
by a light inextensible string, or being required to remain in contact with a fixed rigid 
surface (see section 4.2). Since constraint forces are not known beforehand one may won¬ 
der how to find the work that they do. The answer is that, in the idealised problems that 
we study, the work done by the constraint forces is often zero. In these cases the con¬ 
straint forces make no contribution to the energy principle and they can be disregarded. 
.Situations in which constraint forces do no work include: 
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FIGURE 6.4 The particle P slides over the 
fixed smooth surface S. The reaction R is 
normal to S and hence perpendicular to v , the 
velocity of P. 


R 



Some constraint forces that do no work 

• A particle connected to a fixed point by a light inextensible string; here the string 
tension floes no work. 

• A particle sliding along a smooth fixed wire; here the reaction of the wire does 
no work. 

• A particle sliding over a smooth fixed surface; here the reaction of the surface 
does no work. 


Consider for example the case of a particle P sliding over a smooth fixed surface S as 
shown in Figure 6.4. Because S is smooth, any reaction force R that it exerts must always 
be normal to S. But, since P remains on S, its velocity v must always be tangential to S. 
Hence R is always perpendicular to v so that R • v — 0. Thus the rate of working of R 
is zero and so R makes no contribution to the energy principle. Very similar arguments 
apply to the other two cases. 

We may now extend the use of conservation of energy as follows: 


Energy conservation in constrained motion 

When a particle moves in a conservative force field and is subject to constraint 
forces that do no work, the sum of its kinetic and potential energies remains con¬ 
stant in the motion. 


Example 6.12 The snowboarder 

A snowboarder starts from rest and descends a slope, losing 320 m of altitude in the 
process. What is her speed at the bottom? [Neglect all forms of resistance and take 
g = 10 m s -2 .] 
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Solution 

The snowboardcr moves under uniform gravity and the reaction force of the smooth 
hillside. Since this reaction force does no work, energy conservation applies in the 
form 


\m\v\ 2 + mgz = E, 

where in and v are the mass and velocity of the snowboarder, and z is the altitude of 
the snowboarder relative to the bottom of the hill. If the snowboarder starts from rest 
at altitude //, then E = 0 + nigh. Hence, at the bottom of the hill where z = 0, her 
speed is 


M = (2 gh) l/2 , 

just as if she had fallen down a vertical hole! This speed evaluates to 80 ms -1 , about 

180 mph. [At such speeds, air resistance would have an important inlluence.] ■ 

Our next example concerns a particle constrained to move on a vertical circle. This is 
one of the classical applications of the energy conservation method. There arc two distinct 
cases: (i) where the particle is constrained always to remain on the circle, or (ii) where 
the particle is constrained to remain on the circle only while the constraint force has a 
particular sign. 

Example 6.13 Motion in a vertical circle 


A fixed hollow sphere has centre O and a smooth inner surface of radius b. A particle 
P, which is inside the sphere, is projected horizontally with speed u from the lowest 
interior point (sec Figure 6.5). Show that, in the subsequent motion, 

v 2 = u 2 — 2gb(l — cos 6), 

provided that P remains in contact with the sphere. 

Solution 

While P remains in contact with the sphere, the motion is as shown in Figure 6.5. 
The forces acting on P are uniform gravity nig and the constraint force N , which 
is the normal reaction of the smooth sphere. Since N is always perpendicular to v 
(the circumferential velocity of P), it follows that N does no work. Hence energy 
conservation applies in the form 

,mi) — in gb cos 0 = E, 

where in is the mass of P , and the zero level of the potential energy is the horizontal 
plane through O. Since v = u when 0 = 0, it follows that E = \mu 2 — nigb and the 
energy conservation equation becomes 

v 2 = u 2 — 2gb{\ — cosd), (6.17) 
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FIGURE 6.5 Particle P slides on the smooth 
inner surface of a fixed sphere. The motion 
takes place in a vertical plane through the 
centre O. 

as required. This gives the value of v as a function of 6 while P remains in contact 
with the sphere. ■ 

Question The reaction force 

Find the reaction force N as a function of G. 

Answer 

Once the motion is determined (for example by equation (6.17)), the unknown con¬ 
straint forces may be found by using the Second Law in reverse. In the present case, 

consider the component of the Second Law F = in a in the direction P O. This gives 

N — mg cos 0 = mv 2 /b, 

where we have made use of the formula (2.17) for the acceleration of a particle in 
general circular motion. On using the formula for v 2 from equation (6.17), we obtain 

mu 2 

N = -+ mg (3 cos 9 — 2). (6.18) 

b 

This gives the value of N as a function of 0 while P remains in contact with the 
sphere. ■ 

Question Does P leave the surface of the sphere? 

For the particular case in which w = (3gb) 1 ' 2 , show that P will leave the surface of 
the sphere, and find the value of 9 at which it docs so. 

Answer 

When m = ( 3gb )'^ 2 , the formulae (6.17), (6.18) for v 2 and N become 

v 2 = gb( 1 + 2cos0), N = mg( 1 + 3cos#). 

If P remains in contact with the sphere, then it comes to rest when v = 0, that is, 
when cos# = —1/2. This first happens when 9 = 120° (a point on the upper half of 
the sphere, higher than O). If P were threaded on a circular wire (from which it could 
not fall off) this is exactly what would happen; P would perform periodic oscillations 



V = b0 
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in the range —120° < 6 < 120°. However, in the present case, the reaction N is 
restricted to be positive and this condition will be violated when 0 > cos 1 (— 1 /3) ~ 
109°. Since this angle is less than 120°, the conclusion is that P loses contact with 
the sphere when 0 = cos -1 (—1/3); at this instant, the speed of P is (gb/ 3) l//2 . P 
then moves as a free projectile until it strikes the sphere. ■ 

Question Complete circles 

How large must the initial speed be for P to perfonn complete circles? 

Answer 

For complete circles to be executed, it is necessary (and sufficient) that v > 0 and 
N > 0 at all times, that is, 

u 2 > 2gb(\ — cost?) and u 2 > gb(2 — 3cos 9) 

for all values of 6. For these inequalities to hold for all 0 , the speed w must satisfy 
w 2 > 4 gb and a 2 > 5gb respectively. Since the second of these conditions implies 
the first, it follows that P will execute complete circles if w 2 > 5 gb. ■ 


Example 6.14 Small oscillations in constrained motion 


A particle P of mass m can slide freely along a long straight wire. P is connected 
to a fixed point A , which is at a distance 4 a from the wire, by a light elastic cord of 
natural length 3 a and strength a. Find the approximate period of small oscillations of 
P about its equilibrium position. 

Solution 

Suppose P has displacement x from its equilibrium position. In this position, the 
length of the cord is (16« 2 + ,v 2 )'/ 2 and its potential energy V is 


V = ja 
1 


(l 6a 2 + x 2 )'' 2 -la 
25 a 2 + x 2 — 6a (\6a 2 + x 2 \ 
The energy conservation equation for P is therefore 


— 2 a 


1 / 2 ' 


I *2 1 

^mx + ja 


25a 2 + x 2 — 6 a ^16tf 2 + .v 2 ^ 


1 / 2 ' 


= E, 


which, on neglecting powers of x higher than the second, becomes 

2 y2 " 

, //1.V [ 0 ^ ” 


a + 4 


= E. 


On differentiating this equation with respect to t, we obtain the approximate lin¬ 
earised equation of motion 


a 


mx H- x = 0. 

4 
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This is the SHM equation with ur = a/Am. It follows that the approximate period 
of small oscillations about jc = 0 is An(m /a) x/2 . ■ 


Energy conservation from a physical viewpoint 

Suppose that a particle P of mass m can move on the .v-axis and is connected to a fixed 
post at x = —a by a light clastic spring of natural length a and strength a. Then the force 
F (x) exerted on P by the spring is given by F = —cr.v, where x is the displacement of P 
in the positive x -direction. This force field has potential energy V = lax 2 and the energy 
conservation equation for P takes the form 

\mv 2 + {ax 2 = E, (6.19) 

where v = x. Here, the spring is regarded merely as an agency that supplies a force field with 
potential energy V = {ax 2 . However, there is a much more satisfying interpretation of the 
energy conservation principle (6.19) that can be made. 

To see this we consider the spring as described above, but now with no particle attached 
to its free end. Suppose that the spring is in equilibrium (with its free end at x = 0) when an 
external force G(t) is applied there. This force is initially zero and increases so that, at any 
time t, the spring has extension X = G(t)/a. Suppose that this process continues until the 
spring has extension A. Then the total work done by the force G ( t ) in producing this extension 
is given by 

C r C r dX 

/ G{t)Xdt = / aX - dt = / aX dX = {a A 2 . 

Jo Jo dt J 0 2 

Since the force exerted by the fixed post does no work, the total work done by the external 
forces in producing the extension A is |aA 2 . Suppose now that the spring is ‘frozen’ in its 
extended state (by being propped open, for example) while the particle P is connected to the 
free end. The system is then released from rest. The energy conservation equation for P is 
given by equation (6.19), where, from the initial condition v = 0 when x = A, the total energy 
E = {a A 2 . This gives 

{mv 2 + {ax 2 = |aA 2 . (6.20) 

Thus, the total energy in the subsequent motion is equal to the original work done in stretching 
the spring. The natural physical interpretation of this is that the spring is able to store the work 
that is done upon it as internal energy. Then, when the particle is connected and the system 
released, this stored energy is available to be transferred to the particle in the form of kinetic 
energy. Equation (6.20) can thus be interpreted as an energy conservation principle for the 
particle and spring together, as follows: In any motion of the particle and spring, the sum 
of the kinetic energy of the particle and the internal energy of the spring remains constant. In 
this interpretation, the particle has no potential energy; instead, the spring has internal energy. 

In the above example, the particle and the spring can pass energy to each other, but the 
total of the two energies is conserved. This is the essential nature of energy. It is an entity that 
can appear in different forms but whose total is always conserved. Energy is probably the 
most important notion in the whole of physics. However, it should be remembered that, in the 
context of mechanics, it is not usual to take account of forms of energy such as heat or light. 


Copyrighted material 


6.5 Problems 


151 


As a result, we will find situations (inelastic collisions, for example) in which energy seems to 
disappear. There is no contradiction in this; the energy has simply been transferred into forms 
that we choose not to recognise. 


Problems on Chapter 6 

Answers and comments are at the end of the book. 
Harder problems carry a star (*). 


Unconstrained motion 

6.1 A particle P of mass 4 kg moves under the action of the force F = 4i + lit 2 j N, where 
t is the time in seconds. The initial velocity of the particle is 2 i + j + 2k ms -1 . Find the 
work done by F , and the increase in kinetic energy of P, during the time interval 0 < t < 1. 
What principle does this illustrate? 

6.2 In a competition, a man pushes a block of mass 50 kg with constant speed 2 m s -1 up a 
smooth plane inclined at 30° to the horizontal. Find the rate of working of the man. [Take 
g = 10 ms -2 .] 

6.3 An athlete putts a shot of mass 7 kg a distance of 20 m. Show that the athlete must do to 
at least 700 J of work to achieve this. [ Ignore the height of the athlete and take g = 10 m s -2 .] 

6.4 Find the work needed to lift a satellite of mass 200 kg to a height of 2000 km above the 
Earth’s surface. [Take the Earth to be spherically symmetric and of radius 6400 km. Take the 
surface value of g to be 9.8 m s -2 .] 


6.5 A particle P of unit mass moves on the positive x-axis under the force field 


F = 



(x > 0). 


Show that each motion of P consists of either (i) a periodic oscillation between two extreme 
points, or (ii) an unbounded motion with one extreme point, depending upon the value of the 
total energy. Initially P is projected from the point x = 4 with speed 0.5. Show that P 
oscillates between two extreme points and find the period of the motion. [You may make use 
of the formula 

f h xdx n(a + b) 

Ja [(x - a)(/> - x)] 1 / 2 2 


Show that there is a single equilibrium position for P and that it is stable. Find the period 
of small oscillations about this point. 


6.6 A particle P of mass /// moves on the x-axis under the force field with potential energy 
V = Vo(x/b ) 4 , where To and b are positive constants. Show that any motion of P consists 
of a periodic oscillation with centre at the origin. Show further that, when the oscillation has 
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amplitude a, the period r is given by 


r 


= 2sfl 


(n) 

1 1/2 'll 

f' d% 

UV 

a J 

0 (1 -£ 4 ) ,/2 


[Thus, the larger the amplitude, the shorter the period!] 


6.7 A particle P of mass ///, which is on the negative x-axis, is moving towards the origin 
with constant speed u. When P reaches the origin, it experiences the force F = —Kx 2 , where 
K is a positive constant. How far does P get along the positive x-axis? 

6 .8 A particle P of mass m moves on the x-axis under the combined gravitational attraction 
of two particles, each of mass M, fixed at the points (0, ±a, 0) respectively (see Figure 3.3). 
Example 3.4 shows that the force field F (x) acting on P is given by 

r 2 iiiMGx 

F = “(a 2 +..r 2 ) 3 / 2 ‘ 

Find the corresponding potential energy V (jc). 

Initially P is released from rest at the point* = 3a/4. Find the maximum speed achieved 
by P in the subsequent motion. 


6.9 A particle P of mass m moves on the axis Oz under the gravitational attraction of a 
uniform circular disk of mass M and radius a as shown in Figure 3.6. Example 3.6 shows that 
the force field F (z) acting on P is given by 


F = 


2 m M G 


z 

(a 2 + z 2 ) 1 / 2 


(z > 0). 


Find the corresponding potential energy V (z) for z > 0. 

Initially P is released from rest at the point z = 4a/3. Find the speed of P when it hits 
the disk. 


6 .10 A catapult is made by connecting a light clastic cord of natural length 2a and strength a 
between two fixed supports, which are distance 2 a apart. A stone of mass m is placed at the 
center of the cord, which is pulled back a distance 3a/4 and then released from rest. Find the 
speed with which the stone is projected by the catapult. 

6.11 A light spring of natural length a is placed on a horizontal floor in the upright position. 
When a block of mass M is resting in equilibrium on top of the spring, the compression of the 
spring is a/15. The block is now lifted to a height 3a/2 above the floor and released from rest. 
Find the compression of the spring when the block first comes to rest. 

6.12 A particle P carries a charge e and moves under the influence of the static magnetic field 
B{r) which exerts the force F = ev x B on P, where v is the velocity of P. Show that P 
travels with constant speed. 

6 . 13 * A mortar shell is to be fired from level ground so as to clear a flat topped building of 
height // and width a. The mortar gun can be placed anywhere on the ground and can have 
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any angle of elevation. What is the least projection speed that will allow the shell to clear the 
building? [Hint How is the reqired minimum projection speed changed if the mortar is raised 
to rooftop level?] 

For the special case in which // = \a , find the optimum position for the mortar and the 
optimum elevation angle to clear the building. 

If you are a star at electrostatics, try the following two problems: 

6.14* An earthed conducting sphere of radius a is fixed in space, and a particle P, of mass 
m and charge q, can move freely outside the sphere. Initially P is a distance b ( > a) from the 
centre O of the sphere when it is projected directly away from O. What must the projection 
speed be for P to escape to infinity? [Ignore electro dynamic effects. Use the method of images 
to solve the clectrostafrc problem.] 

6.15* An uncharged conducting sphere of radius a is fixed in space and a particle P , of mass 
in and charge q, can move freely outside the sphere. Initially P is a distance b (> a) from the 
centre O of the sphere when it is projected directly away from O. What must the projection 
speed be for P to escape to infinity? [Ignore electro dynamic effects. Use the method of images 
to solve the electrostatic problem. | 

Constrained motion 

6.16 A bead of mass in can slide on a smooth circular wire of radius a, which is fixed in 
a vertical plane. The bead is connected to the highest point of the wire by a light spring of 
natural length 3o/2 and strength A.. Dctcnninc the stability of the equilibrium position at the 
lowest point of the wire in the cases (i) a = ling/a, and (ii) A. = 5 mga. 

6.17 A smooth wire has the form of the helix x = a cos 0, y = a sin#, z = bO, where 0 is 
a real parameter, and a, b are positive constants. The wire is fixed with the axis Oz pointing 
vertically upwards. A particle P , which can slide freely on the wire, is released from rest at the 
point (a, 0, Inb). Find the speed of P when it reaches the point (a , 0, 0) and the time taken 
for it to do so. 

6.18 A smooth wire has the form of the parabola z = x 2 /2b, y = 0, where b is a positive 
constant. The wire is fixed with the axis Oz pointing vertically upwards. A particle P , which 
can slide freely on the wire, is performing oscillations with ,v in the range —a<x<a. Show 
that the period r of these oscillations is given by 

4 r (b 2 + x 2 \ X/1 
(gb )'/ 2 id -x 2 ) dX - 

By making the substitution x — a sin x// in the above integral, obtain a new formula for r. Use 
this formula to find a two-term approximation to r, valid when the ratio a/b is small. 

6.19* A smooth wire has the form of the cycloid .v = c(0 + sin 0), y = 0, z = c(l — cos0), 
where c is a positive constant and the parameter 0 lies in the range —tt < 0 < tt . The wire 
is fixed with the axis Oz pointing vertically upwards. |Make a sketch of the wire.] A particle 
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initial position • " 


FIGURE 6.6 The swing of the pendulum is obstructed by a fixed peg. 

can slide freely on the wire. Show that the energy conservation equation is 

(1 + cos G)6 2 + — (1 — cos 0 ) = constant. 

c 

A new parameter u is defined by u = sin \d. Show that, in terms of u, the equation of motion 
for the particle is 

" + (£:)"• 

Deduce that the particle performs oscillations with period 4n(c/g)^ 2 , independent of the 
amplitude! 

6.20 A smooth horizontal table has a vertical post fixed to it which has the form of a circular 
cylinder of radius a. A light inextensible string is wound around the base of the post (so that 
it does not slip) and its free end of the string is attached to a particle that can slide on the 
table. Initially the unwound part of the string is taut and of length 4a/3. The particle is then 
projected horizontally at right angles to the string so that the string winds itself on to the post. 
How long does it take for the particle to hit the post? [You may make use of the formula 

J ( l+cp 2 ) l/2 d(f>= \4>(l + <*> 2 ) l/2 + ^sinh" 1 0. ] 

6.21 A heavy ball is suspended from a fixed point by a light inextensible string of length 
b. The ball is at rest in the equilibrium position when it is projected horizontally with speed 
{lgb/2) x ! 2 . Find the angle that the string makes with the upward vertical when the ball begins 
to leave its circular path. Show that, in the subsequent projectile motion, the ball returns to its 
starting point. 

6.22* A new avant garde mathematics building has a highly polished outer surface in the 
shape of a huge hemisphere of radius 40 m. The Head of Department, Prof. Oldfart, has his 
student. Vita Youngblood, hauled to the summit (to be photographed for publicity purposes) 
but a small gust of wind causes Vita to begin to slide down. Oldfart’s displeasure is increased 
when Vita lands on (and severely damages) his car which is parked nearby. How far from the 
outer edge of the building did Oldfart park his car? Did he get what he deserved? (Happily, 
Vita escaped injury and found a new supervisor.) 

6 . 23 ** A heavy ball is attached to a fixed point O by a light inextensible string of length 2a. 
The ball is drawn back until the string makes an acute angle a with the downward vertical and 
is then released from rest. A thin peg is fixed a distance a vertically below O in the path of 
the string, as shown in Figure 6.6. In a game of skill, the contestant chooses the value of a and 
wins a prize if the ball strikes the peg. Show that the winning value of a is approximately 86°. 
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Orbits in a central field 

including Rutherford scattering 


KEY FEATURES 

For motion in general central force fields, the key results are the radial motion equation and 
the path equation. For motion in the inverse square force field, the key formulae are the 

E-formula, the L-formula and the period formula. 


The theory of orbits has a special place in classical mechanics for it was the desire 
to understand why the planets move as they do which provided the major stimulus in 
the development of mechanics as a scientific discipline. Early in the seventeenth cen¬ 
tury, Johannes Kepler * published his ‘laws of planetary motion’, which he deduced by 
analysing the accurate experimental observations made by the astronomer Tycho Brahe.* 1 * 


* The German mathematician and astronomer Johannes Kepler (1571-1630) was a firm believer in the 
Copcrnican (heliocentric) model of the solar system. In 1596 he became mathematical assistant to Tycho 
Brahe, the foremost observational astronomer of the day, and began working on the intractable problem 
of the orbit of Mars. This work continued after Tycho’s death in 1601 and, after much labour, Kepler 
showed that Tycho’s observations of Mars corresponded very precisely to an elliptic orbit with the Sun at 
a focus. This result, together with the ‘law of areas’ (the second law) was published in 1609. Kepler then 
found similar orbits for other planets and his third law was published in 1619. 
f Tycho Brahe (1546-1601) was a Danish nobleman. He had a lifelong interest in observational astronomy 
and developed a succession of new and more accurate instruments. The King of Denmark gave him 
money to create an observatory and also the island of Hven on which to build it. It was here that Tycho 
made his accurate observations of the planets from which Kepler was able to deduce his laws of planetary 
motion. Tycho’s other claim to fame is that he had a metal nose. When the original was cut off in a duel, 
he had an artificial nose made from an alloy of silver and gold. Tycho is perhaps better remembered for 
his nose job than he is for a lifetime of observations. 
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FIGURE 7.1 Each planet P moves on an 
elliptical path with the Sun S at one focus. 
The area A is that referred to in Kepler’s 
second law. 


P 



Kepler’s laws of planetary motion 

First law Each of the planets moves on an elliptical path with the Sun at one focus 
of the ellipse. 

Second law For each of the planets, the straight line connecting the planet to the 
Sun sweeps out equal areas in equal times. 

Third law The squares of the periods of the planets are proportional to the cubes of 
the major axes of their orbits. 


The problem of determining the law of force that causes the motions described by 
Kepler (and proving that it does so) was the most important scientific problem of the sev¬ 
enteenth century. In what must be the finest achievement in the whole history of science, 
Newton’s publication of Principia in 1687 not only proved that the inverse square law of 
gravitation implies Kepler’s laws, but also laid down the entire framework of the science 
of mechanics. Orbit theory is just as important today, the principal fields of application 
being astronomy, particle scattering and space travel. 

In this chapter, we treat the problem of a particle moving in a central force field with a 
fixed centre ; this is called the one-body problem. The assumption that the centre of force 
is fixed is an accurate approximation in the context of planetary orbits. The combined 
mass of all the planets, moons and asteroids is less than 0.2% of the mass of the Sun. We 
therefore expect the motion of the Sun to be comparatively small, as are inter-planetary 
influences.* However, we do not confine our interest to motion under the attractive inverse 
square field. At first, we consider motion in any central force field with a fixed centre. This 
part of the theory will then apply not only to gravitating bodies, but also (for example) to 
the scattering of neutrons. The important cases of inverse square attraction and repulsion 
arc then examined in greater detail. 


* The more general two-body problem is treated in Chapter 10. The two-body theory must be used to 
analyse problems in which the masses of the two interacting bodies are comparable, as they are in binary 
stars. 
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FIGURE 7.2 Each orbit of a particle P in a central force field with centre O 
takes place in a plane through O. The position of P in the plane of motion is 
specified by polar coordinates r, 6 with centre at O. 


7.1 THE ONE-BODY PROBLEM - NEWTON’S EQUATIONS 

First we define what we mean by a central force field. 

Definition 7.1 Central field A force field F(r) is said to be a central field with centre 
O if it has the form 


F(r) = F (r)r, 

where r = |/*| and r = r/r. A central field is thus spherically symmetric about its centre. 

A good example of a central force is the gravitational force exerted by a fixed point 
mass. Suppose F has mass m and moves under the gravitational attraction of a point mass 
M fixed at the origin. In this case, the force acting on F is given by the law of gravitation 
to be 


F(r) 


m M G 


,■2 


r , 


where G is the constant of gravitation. This is a central field with 


F(r) 


m M G 
~r^ 


Each orbit lies in a plane through the centre of force 

The first thing to observe is that, when a particle P moves in a central field with centre 
O, each orbit of F takes place in a plane through O, as shown in Figure 7.2. This is the 
plane that contains O and the initial position and velocity of F. One may give a vectorial 
proof of this, but it is quite clear on symmetry grounds that P will never leave this plane. 
Each motion is therefore two-dimensional and we lake polar coordinates r, 6 (centred on 
O) to specify the position of P in the plane of motion. On using the formulae (2.14) for 
the components of acceleration in polar coordinates, the Newton equations of motion for 
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o 


FIGURE 7.3 The angular momentum mr 2 0 = nip v, where 
v = |v|. 


P become 

m (r-r0 2 ) = F(r), 
m (rO + 2 id) = 0. 

Angular momentum conservation 

Equation (7.2) can be written in the form 



which can be integrated with respect to t to give 

mr 2 d = constant. 

The quantity mr 2 d, which is a constant of the motion, is called the angular momentum* 
of P. The general theory of angular momentum (and its conservation) is described in 
Chapter 11, but for now it is sufficient to regard ‘angular momentum’ simply as a name that 
we give to the conserved quantity mv 2 0. This angular momentum has a simple kinematical 
interpretation. From Figure 7.3 it follows that 

mr 6 = mr(r0) = mrvo 
= mp v. 


( ,( v 6 \ 

= m (;• cos cr) (-) 

V cos a ' 


(7.1) 

(7.2) 


where p is the perpendicular distance of O from the tangent to the path of P, and v = 
|t>|. This formula provides the usual way of calculating the constant value the angular 
momentum from the initial conditions. 


More precisely, it is the angular momentum of the particle about the axis {O, &), where the unit vector k 
is perpendicular to the plane of motion (see Figure 7.2). The angular momentum of P about the point O 
is the vector quantity mrxv, but the axial angular momentum used in the present chapter is the component 
of this vector in the Ar-direction. 
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Newton equations in specific form 

It is usual and convenient to eliminate the mass m from the theory. If we write 

F(r) = mf (/•), 

where f(r) is the outward force per unit mass, and let L (= r 2 d ) be the angular momen¬ 
tum per unit mass then the Newton equations (7.1), (7.2) reduce to the specific form 


r-rd 2 = f(r), (7.3) 

r 2 0 = L, (7.4) 

where L is a constant.* Note that these equations apply to orbits in any central field. The 
second of these equations appears throughout this chapter and we will call it the angular 
momentum equation. 


Angular momentum equation 

r 2 d = L 


(7.5) 


Kepler’s second law 

Angular momentum conservation is equivalent to Kepler’s second law. The area A shown 
in Figure 7.1 can be expressed (with an obvious choice of initial line) as 




dd. 


Then, by the chain rule, 


dA 

~dt 



dd 

It 


= = 



where L is the constant value of the angular momentum. Thus A increases at a constant 
rate, which is what Kepler’s second law says. Thus Kepler’s second law holds for all 
central force fields, not just the inverse square law. 


7.2 GENERAL NATURE OF ORBITAL MOTION 

In our first method of solution, we take as our starting point the principles of 
conservation of angular momentum and energy. 


* Without losing generality, we will lake L to be positive, that is, we suppose 0 is increasing with time. 
(The special case in which L — 0 corresponds to rectilinear motion through O .) 
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Energy conservation 

Every central field F = mf(rjr is conservative with potential energy mV (/•), where 

dV 

fir) = - — . (7.6) 

dr 

Energy conservation then implies that 


T + V = E, 

where T is the specific kinetic energy, V is the specific potential energy, and the constant 
E is the specific total energy. On replacing T by its expression in polar coordinates, we 
obtain 


Energy equation 

j (r 2 + (r0f) + V(r) = E 


(7.7) 


as the energy conservation equation. The conservation equations (7.5), (7.7) arc equiva¬ 
lent to the Newton equations (7.1), (7.2) and are a convenient starting point for investigat¬ 
ing the general nature of orbital motion. 

The radial motion equation 

Prom the angular momentum conservation equation (7.5), we have 

6 = L/r 2 

and, on eliminating 6 from the energy conservation equation (7.7), we obtain 

\ i' 2 + V O') + 2r^ = ^’ ( 7 - 8 ) 

an ODE for the radial distance r(t). We call this the radial motion equation for the 
particle P. Equation (7.8) (together with the initial conditions) is sufficient to determine 
the variation of r with t, and the angular momentum equation (7.5) then determines the 
variation of 6 with t. Unfortunately, for most laws of force, this procedure cannot be 
carried through analytically. However, it is still possible to make important deductions 
about the general nature of the motion. 

Equation (7.8) can be written in the form 

\r 2 + V*(r) = £, (7.9) 

where 

i} 

V*(r) = V(r) + —r. (7.10) 

2 r A 
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FIGURE 7.4 The effective potential V * shown admits bounded and 
unbounded orbits, depending on the initial conditions. 

The function V*(r) is called the effective potential of the radial motion and its use 
reduces the radial motion of P to a rectilinear problem. It must be emphasised 
though that the whole motion is two-dimensional since 0 is increasing in accordance 
with (7.5). 

Because r satisfies the radial motion equation (7.9), the variation of r with t can be 
analysed by using the same methods as were used in Chapter 6 for rectilinear particle 
motion. In particular, the general nature of the motion depends on the shape of the graph 
of V* (which depends on L) and the value of E. The values of the constants L and E 
depend on the initial conditions. 

Suppose for example that the law of force and the initial conditions are such that V * 
has the form shown in Figure 7.4 and that E has the value shown. Then, since T > 0, it 
follows that the motion is restricted to those values r that satisfy the inequality 

V*(r) < E, 

with equality holding when r — 0. There are two possible motions, in each of which the 
variation of r with t is governed by the radial motion equation (7.8). 

(i) a bounded motion in which r oscillates in the range [«, b |. In this motion, r(t) is 
a periodic function.* 

(ii) an unbounded motion in which r lies in the interval [c, oo). In this motion r is 
not periodic but decreases until the minimum value r = c is achieved and then 
increases without limit. 

The bounded orbit. A typical bounded orbit is shown in Figure 7.5 (left). The orbit 
alternately touches the inner and outer circles r = a and r = b, which corresponds to 
the radial coordinate r oscillating in the interval [a, b]. Without losing generality, suppose 
that P is at the point B\ when t = 0 and that OB\ is the line 6 = 0. Consider the part of 


* The fact that r(t) is periodic does not mean that the whole motion must be periodic. 
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FIGURE 7.5 Typical bounded and unbounded orbits. 


the orbit between A \ and A 2 . It follows from the governing equations (7.8), (7.5) that r is 
an even function of t while 0 is an odd function of t. This means that the segment #1 Ai of 
the orbit is just the reflection of the segment A\B\ in the line OB\. This argument can be 
repeated to show that the segment A 2 B 2 is the reflection of the segment B\A 2 in the line 
OA 2 , and so on. Thus the whole orbit can be constructed from a knowledge of a single 
segment such as A\B\. 

It follows from what has been said that the angles A\OB\, B\OA 2 , A 2 OB 2 (and 
so on) are all equal. Let a be the common value of these angles. Then the orbit will 
eventually close itself if some integer multiple of a is equal to some whole number of 
complete revolutions, that is, if a/n is a rational number. There is no reason to expect 
this condition to hold and, in general, it does not. It follows that these bounded orbits are 
not generally closed. The closed orbits associated with the attractive inverse square field 
are therefore exceptional, rather than typical! 

The unbounded orbit. In the unbounded case there are just two segments both of which 
are semi-infinite (see Figure 7.5 (right)). The segment in which P recedes from O is the 
reflection of the segment in which P approaches O in the line OC. 

Apses and apsidal distances 

The points at which an orbit touches its bounding circles are important and are given a 
special name: 

Definition 7.2 Apse, apsidal distance, apsidal angle A point of an orbit at which 
the distance OP achieves its maximum or minimum value is called an apse of the orbit. 
These maximum and minimum distances are called the apsidal distances and the angu¬ 
lar displacement between successive apses (the angle a in Figure 7.5 (left)) is called the 
apsidal angle. 
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FIGURE 7.6 Left: The effective potential V* for the attractive inverse square force. Right: The 
path of the asteroid around the Sun. C is the point of closest approach. 

In the special case of orbits around the Sun, the point of closest approach is called the 
perihelion and the point of maximum distance the aphelion. The corresponding terms for 
orbits around the Earth are perigee and apogee. 

The apsidal distances, the maximum and minimum distances of P from O, are easily 
found from the radial motion equation (7.8). At an apse, r = 0 and so r must satisfy 



(7.11) 


The positive roots of this equation are the apsidal distances. 


Example 7.1 Asteroid deflected by the Sun 


A particle P of mass iii moves in the central force field —(my/r 2 )'r, where y is a 


positive constant. Show that bounded and unbounded orbits are possible depending 
on the value of E. 

An asteroid is approaching the Sun from a great distance. At this time it has 
constant speed u and is moving in a straight line whose perpendicular distance from 
the Sun is p. Find the equation satisfied by the apsidal distances of the subsequent 
orbit. For the special case in which u 2 = 4A/©G/3p (where Mq is the mass of the 
Sun), find (i) the distance of closest approach of the asteroid to the Sun, and (ii) the 
speed of the asteroid at the time of closest approach. 

Solution 

For this law of force, V = —y/r and the effective potential V* is 


This V* has the form shown in Figure 7.6 (left), from which it is clear that the orbit 
will be 

(i) bounded if E <0, 

(ii) unbounded if E >0, 
whatever the value of L. 
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In the asteroid example, the constant y = MqG , where Mq is the mass of the 
Sun and G is the constant of gravitation. With the given initial conditions, L = p u 
and E = u 2 / 2, so that E > 0 and the orbit is unbounded. 

The equation (7.11) for the apsidal distances becomes 


2 2 
Y , P “ 

r 2r 2 



2 


that is. 


u 2 r 2 + 2 yr — p 2 u 2 = 0, 


where y = MqG. 

For the special case in which u 2 = 4 MqG/ 3p, this equation simplifies to 


2r 2 + 3 pr - 2p 2 = 0 . 


The distance of closest approach of the asteroid is the positive root of this quadratic 
equation, namely r = p/2. 

The speed V of the asteroid at closest approach is easily deduced from angular 
momentum conservation. Initially, L = pu and, at closest approach, L = (p/2)V. It 
follows that V = 2w. ■ 


7.3 THE PATH EQUATION 

In principle, the method of the last section allows us to determine the complete 
motion of the orbiting body as a function of the time. However, the procedure is usually 
too difficult to be carried through analytically. We can make the problem easier (and 
make more progress) by seeking just the equation of the path taken by the body, and not 
enquiring where the body is on this path at any particular time. 

We start from the Newton equation (7.3) and try to eliminate the time by using the 
angular momentum equation (7.5). In doing this it is helpful to introduce the new depen¬ 
dent variable u, given by 


m = 1/r. 


( 7 . 12 ) 


This transformation has a magically simplifying effect. We begin by transforming r and 
r. By the chain rule, 


At \uj 


1 du dO 
— X — x — 
M 2 d6 dt 



which, on using the angular momentum equation (7.5), gives 



(7.13) 
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A second differentiation with respect to t then gives 


d (du\ d 2 u dd 

r = —L — I — = —L —- x — 

dr \dd dd 2 dt 


-L 2 u 2 


d~u 
dd 2 ’ 


(7.14) 


on using the angular momentum equation again. 

The term rd 2 = L 2 u 3 so that the Newton equation (7.3) is transformed into 

- L 2 u 2 ^ - L 2 u 3 = /(1/m), 

that is, 



(7.15) 


This is the path equation. Its solutions are the polar equations of the paths that the body 
can take when it moves under the force field F = mf (r Jr. 

Despite the appearance of the left side of equation (7.15), the path equation is not 
linear in general. This is because the right side is a function of u, the dependent variable. 
Only for the inverse square and inverse cube laws does the path equation become linear. 
It is a remarkable piece of good luck that the inverse square law (the most important case 
by far) is one of only two cases that can be solved easily. 

Initial conditions for the path equation 

.Suitable initial conditions for the path equation are provided by specifying the values of u 
and du/dd when d = a, say. Since u — 1/r, the initial value of u is given directly by the 
initial data. The value of du/dd is not given directly but can be deduced from equation 
(7.13) in the form 


du r 
Id ~ ~L* 


(7.16) 


where r and L are obtained from the initial data. 


Example 7.2 Path equation for the inverse cube law 

The engines of the starship Enterprise have failed and the ship is moving in a straight 
line with speed V. The crew calculate that their present course will miss the planet 
B -Zar by a distance p. However, B -Zar is known to exert the force 


Copyrighted material 













166 


Chapter 7 Orbits in a central field 



on any mass m in its vicinity. A measurement of the constant y reveals that 

8 t) 2 V 2 

Y = — 9— 1 

Show that the crew of the Enterprise will get a free tour around B -Zar before contin¬ 
uing along their original path. What is the distance of closest approach and what is 
the speed of the Enterprise at that instant? 

Solution 

For the given law of force, fir) = — y/r 3 so that /(1/w) = —yu 3 . Also, from the 
initial conditions, L = pV. The path equation is therefore 

d 2 u yu 3 

d 92 +U= p 2y2 u i' 


which simplifies to 


dQ 2 9 


= 0 , 


on using the stated value of y. The general solution of this equation is 


u = A cos(0/3) + B sin(0/3). 


The constants A and B can now be determined from the initial conditions. Take the 
initial line 9 = 0 as shown in Figure 7.7. Then: 

(i) The initial condition /• = oo when 9 = 0 implies that u = 0 when 9 = 0. It 
follows that A = 0. 

(ii) The initial condition on du/d9 is given by (7.16) to be 

du _ r _ (~V\ _ 1 

d9 ~ ~L ~ ~ )~p 

when 9 = 0. It follows that B = 3/p. 

The required solution is therefore 

3 

u = — sin(0/3), 

P 
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that is 

. _ P 
' 3sin(0/3) * 

This is the polar equation of the path of the Enterprise, as shown in Figure 7.7. The 
Enterprise recedes to infinity when sin(6>/3) = 0 again, that is when 6 = 3n. Thus 
the Enterprise makes one circuit of B-Zar before continuing on as before. 

The distance of closest approach is p/3 and is achieved when 0 = 3^/2. By 
angular momentum conservation, the speed of the Enterprise at that instant is 3V. ■ 


7.4 NEARLY CIRCULAR ORBITS 


Although the path equation cannot be solved exactly for most laws of force, it is 
possible to obtain approximate solutions when the body is slightly perturbed from a known 
orbit. In particular, this can always be done when the unperturbed orbit is a circle with 
centre O. 

Suppose that a particle P moves in a circular orbit of radius a under the attractive 
force /(/ ) per unit mass. This is only possible if its speed v satisfies v 2 /a = f(a), in 
which case its angular momentum L is given by L 2 = o 3 /(o). Suppose that P is now 
slightly disturbed by a small radial impulse. The angular momentum is unchanged but P 
now moves along some new path 

k = -(1+$(0)), 

a 

where £ is a small perturbation. In terms of £, the path equation becomes 


dO 2 


+ 1 +£ = + 


(1 + £)~ 2 
f{a) 



This exact equation for £ is non-linear, but we will now approximate it by expanding the 
right side in powers of £. On expanding the function /(/ ) in a Taylor series about r — a 
we obtain 


/ 



= /(a) - (f^|) /'(») + 0 
= f (a) - af'{a ) £ + O , 



2 


and a simple binomial expansion gives 

(i + *r 2 = i — 2$ + o (i- 2 ). 


On combining these results together, the constant terms cancel and we obtain 



(7.17) 
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on neglecting terms of order O (£ 2 ). This is the approximate linearised equation satisfied 
by the perturbation £(0). 

The general behaviour of the solutions of equation (7.17) depends on the sign of the 
coefficient of £. 


(i) If 


3 + 


af'ja) 

f(a) 


< 0 , 


(7.18) 


then the solutions are linear combinations of real exponentials, one of which has 
a positive exponent. In this case, the solution for £ will not remain small, contrary 
to assumption. The conclusion is that the original circular orbit is unstable. 

(ii) Alternatively, if 


£2 2 = 3 + 


f(a) 


> 0 , 


(7.19) 


then the solutions are linear combinations of real cosines and sines, which remain 
bounded. The conclusion is that the original circular orbit is stable (at least to 
small radial impulses). 


Closure of the perturbed orbits 

From now on we will assume that the stability condition (7.19) is satisfied. The general 
solution of equation (7.17) then has the form 


£ = A cos £20 + B sin £20. 


We see that the perturbed orbit will close itself after one revolution if £2 is a positive 
integer. When the law of force is the power law 


f(r) = kr\ 

the perturbed orbit is stable for v > — 3 and will close itself after one revolution if 

v = in 2 — 3, 

where m is a positive integer. The case m = 1 corresponds to inverse square attraction 
and m — 2 corresponds to simple harmonic attraction. The exponents v = 6, 13, ... 
are also predicted to give closed orbits. It should be remembered though that these are 
only the predictions of the approximate linearised theory.* It is possible (but not pretty) 
to improve on the linear approximation by including quadratic terms in £ as well as linear 
ones. The result of this refined theory is that the powers v = —2 and v = 1 still give 


* It makes no sense to say that an orbit approximately closes itself! 
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closed orbits, but the powers v = 6, 13, ... do not. This shows that the power laws with 
v = 6, 13, ... do not give perturbed orbits that close after one revolution, but the cases 
v = —2 and v = 1 are still not finally decided. Mercifully, there is no need to carry the 
approximation procedure any further because all the paths corresponding to both inverse 
square and simple harmonic attraction can be calculated exactly. It is found that, for 
these two laws of force, all bounded orbits close after one revolution.* There remains the 
possibility that the perturbed orbits might close themselves after more than one revolution, 
but a similar analysis shows that this does not happen. We have therefore shown that the 
only power laws for which all bounded orbits are closed are the simple harmonic and 
inverse square laws. This result is actually true for all central fields (not just power laws) 
and is known as Bertrand’s theorem. 


Precession of the perihelion of Mercury 

The fact that the inverse square law leads to closed orbits, whilst very similar laws do not, 
provides an extremely sensitive test of the law of gravitation. Suppose for instance that 
the attractive force experienced by a planet were 


fir) = 


y 

r 2+< 


(per unit mass), where y > 0 and |e| is small. Then the value of £2 for a nearly circular 
orbit is 


Q , = (1 - <?) 1/2 = 1 - + 0 ( c 2 ). 


This perturbed orbit does not close but has apsidal angle a, where 


7T 



7r 

l - + 0 (€ 2 ) 


= n{\ + + 0(c 2 ). 


Hence successive perihelions of the planet will not occur at the same point, but the per¬ 
ihelion will advance ‘annually’ by the small angle ttc. The position of the perihelion 
of a planet can be measured with great accuracy. For the planet Mercury it is found 
(after all known perturbations have been subtracted out) that the perihelion advances by 
43 (±0.5) seconds of arc per century, or 5 x 10 -7 radians per revolution. This correponds 
to e = 1.6 x 10 -7 and a power of —2.00000016 instead of —2. Miniscule though this 
discrepancy from the inverse square law seems, it is considerably greater than the error in 
the observations and for a considerable time was something of a puzzle. 

This puzzle was resolved in a striking fashion by the theory of general relativity, 
published by Einstein in 1915. Einstein showed that one consequence of his theory was 
that planetary orbits should precess slightly and that, in the case of Mercury, the rate of 
precession should be 43 seconds of arc per century! 


* In the inverse square case, the bounded orbits are ellipses with a focus at O. and, in the simple harmonic 
case, they are ellipses with the centre at O. 
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7.5 THE ATTRACTIVE INVERSE SQUARE FIELD 


Because of its many applications to astronomy, the attractive inverse square field 
is the most important force field in the theory of orbits. The same field occurs in particle 
scattering when the two particles carry unlike electric charges. Because of these important 
applications, we will treat the inverse square field in more detail than other fields. In par¬ 
ticular, we will obtain formulae that enable inverse square problems to be solved quickly 
and easily without referring to the equations of motion at all. 

The paths 

Suppose that f(r) = — y/r 2 where y > 0. Then /(1 /m) = — yu 2 and the path equation 


becomes 



where L is the angular momentum of the orbit. This has the form of the SUM equation 
with a constant on the right. The general solution is 



which can be written in the form 



(7.20) 


where e, a are constants with c > 0. This is the polar equation of a conic of eccentricity 
c and with one focus at 0 \ a is the angle between the major axis of the conic and the initial 
line 6 = 0. If e < 1, then the conic is an ellipse; if e = 1 then the conic is a parabola; 
and when e > 1 the conic is the near branch of a hyperbola. The neccessary geometry 
of the ellipse and hyperbola is summarised in Appendix A at the end of the chapter; the 
special case of the parabolic orbit is of marginal interest and we will make little mention 
of it. 

Kepler’s first law 

It follows from the above that the only bounded orbits in the attractive inverse square field 
are ellipses with one focus at the centre of force. This is Kepler’s first law, which is 
therefore a consequence of inverse square law attraction by the Sun. It would not be true 
for other laws of force. 

The L-formula and the E-formula 

By comparing the path formula (7.20) with the standard polar forms given in Appendix 
A, we see that the angular momentum L of the orbit is related to the conic parameters a , 
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b by the formula 


y a 


L 2 b 2 ' 


that is, 


The L-formula 


L 2 = yb 2 /a 


(7.21) 


Wc will call this result the L-formula. It applies to both elliptic and hyperbolic orbits. 
It is the first of two important formulae that relate L, E, the dynamical constants of the 
motion, to the conic parameters of the resulting orbit. 

The second such formula involves the energy E. At the point of closest approach 
/• = c , 



where V is the speed of P when r = c. Since P is moving transversely at the point of 
closest approach, it follows that cV = L , so that E may be written 



y yb 2 y 


c ac 2 c 


on using the L-formula. 

From this point on, the different types of conic must be treated separately. When the 
orbit is an ellipse, c = a( 1 — e), where e is the eccentricity, and a, b and c are related by 
the formula 



Then E can be written 


ya 2 ( 1 — e 2 ) y 


2tf 3 (l — e) 2 a( 1 — c) 

_y_ 

2a 


Thus the total energy E in the orbit is directly connected to a , the semi-major axis of the 
elliptical orbit. The parabolic and hyperbolic orbits are treated similarly and the full result, 
which we will call the E-formula, is 
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FIGURE 7.8 The asteroid A moves on a hyperbolic orbit 
around the Sun S as a focus and is deflected through the 
angle f). 



The E-formula 


Ellipse: 

E < 0 

E — — — 

2a 

Parabola: 

E = 0 


Hyperbola: 

E > 0 

II 

+ 


(7.22) 


Note that the type of orbit is determined solely by the sign of the total energy E. It 
follows that that the escape condition (the condition that the body should eventually go 
off to infinity) is simply that E > 0. This useful result is true only for the inverse square 
law. 


Example 7.3 Asteroid deflected by the Sun 

An asteroid approaches the Sun with speed V along a line whose perpendicular dis¬ 
tance from the Sun is p. Find the angle through which the asteroid is deflected by the 
Sun. 


Solution 

In this case we have the attractive inverse square field with y = MqG , where Mq 
is the mass of the Sun. This problem can be solved from first principles by using the 
path equation, but here we make short work of it by using the L- and E-formulae. 

From the initial conditions, L = pV and E = \V 2 . Since E > 0, the orbit is the 
near branch of a hyperbola and the L- and E-fonnulae give 


p2y2 _ 


and 



MqG 

2a 
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It follows that 


a = 


MqG 

~V 2 ~ 


b = p. 


The semi-angle a between the asymptotes of the hyperbola is then given (see 
Appendix A) by 


tan a = 


b 

a 


pV 2 

M 0 G 


Let p be the angle through which the asteroid is deflected. Then (see Figure 7.8) 
fi = n —2a and 


tan(/J/2) = tan(7r/2 — a) = cot a = 


MqG 

pV 2 


Period of the elliptic orbit 

Whatever the law of force, once the path of P has been found, the progress of P along 
that path can be deduced from the angular momentum equation 

r 2 0 = L. 


If we take 0 = 0 when t = 0, then the time t taken for P to progress to the point of the 
orbit with polar coordintes r , 6 is given by 


1 




r 2 dd , 


(7.23) 


where r = r{6) is the equation of the path. In particular then, the period r of the elliptic 
orbit is given by 


1 C 2n 

r — — I r 2 d6, 

L Jo 

where the path r = r{0) is given by 


l a 

— = -=-=■ (1 + e cosO) 
r b L 


(7.24) 


Fortunately there is no need to evaluate the above integral since, for any path that closes 
itself after one circuit, 


if '■ 

Jo 


dO = A, 


where A is the area enclosed by the path. For the elliptical path, A = nab so that 

2nab 


T = 
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and on using the L-formula, the period of the elliptic orbit is given by: 


The period formula 



(7.25) 


Kepler’s third law 

In the case of the planetary orbits, y = MqG, where A/© is the mass of the Sun. Equation 

(7.25) can then be written 



(7.26) 


This is Kepler’s third law, which is therefore a consequence of inverse square law attrac¬ 
tion by the Sun and would not be true for other laws of force. 

Masses of celestial bodies 

Once the constant of gravitation G is known, the formula (7.26) provides an accurate 
way to find the mass of the Sun. The same method applies to any celestial body that has 
a satellite. All that is needed is to measure the major axis 2 a and the period r of the 
satellite’s orbit.* 

Question Finding the mass of Jupiter 

The Moon moves in a nearly circular orbit of radius 384,000 km and period 27.32 
days. Callisto, the fourth moon of the planet Jupiter, moves in a nearly circular orbit 
of radius 1,883,000 km and period 16.69 days. Estimate the mass of Jupiter as a 
multiple of the mass of the Earth. 

Answer 

Mj = 316 M e . 

Astronomical units 

For astronomical problems, it is useful to write the period equation (7.26) in astronomical 
units. In these units, the unit of mass is the mass of the Sun (Mq), the unit of length (the 
All) is the semi-major axis of the Earth’s orbit, and the unit of time is the (Earth) year. On 


It should be noted that here we are neglecting the motion of the centre of force. We will see later that, 
when this is taken into account, formula (7.26) actually gives the sum of the masses of the body and 
its satellite. Usually, the satellite has a much smaller mass than the body and its contribution can be 
disregarded. 
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FIGURE 7.9 The eccentric angle xf/ corresponding to the polar 
angle 9. 

substituting the data for the Earth and Sun into equation (7.26), we find that G = 4n 2 in 
astronomical units. Hence, in astronomical units the period formula becomes 



Question The major axis of the orbit of Pluto 

The period of Pluto is 248 years. What is the semi-major axis of its orbit? 

Answer 

39.5 AU. ■ 

Time dependence of the motion - Kepler’s equation 

The formula (7.23) can be used to find how long it takes for P to progress to a general point 
of the orbit. However, although the integration with respect to 9 can be done in closed 
form, it is a very complicated expression. In order to obtain a manageable formula, we 
make a cunning change of variable, replacing the polar angle 6 by the eccentric angle \js. 
The relationship between these two angles is shown in Figure 7.9. Since CN = C F + FN, 
it follows that 


a cos \js = ae + r cos 6, 

and, on using the polar equation for the ellipse (7.24) together with the formula b 2 = 
a 2 ( 1 — e 2 ), the relation between and 0 can be written in the symmetrical form 


(1 — ccos \j/){\ + ecosO ) = —z. 


a- 


(7.27) 


Implicit differentiation of equation (7.27) with respect to \J/ then gives 


eld 


dxfr a (1 — ecos xfr) ’ 


(7.28) 
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after more manipulation. 

We can now make the change of variable from 0 to iff. From (7.23) and (7.24) 

b 4 f e eld 
a 2 L Jo (l+ccos#) 2 
b 4 1 ( d0 \ 

art Jo (1 + e cos#) 2 ^ 

ab f* 

= — / (1 — ?cos iff) d iff, 

L Jo 

ab 

= -(iff-esmi/f), 

on using (7.27), (7.28). Finally, on making use of the L-fonnula L 2 = yb 2 /a, we obtain 


Kepler’s equation 

T 

t = — (\fr — e sin if/) 

2 JT 


(7.29) 


where r (given by (7.23)) is the period of the orbit. This is Kepler’s equation which gives 
the time as a function of position on the elliptical orbit. 

If one needs to calculate the position of the orbiting body after a given time, then 
equation (7.29) must be solved numerically for the eccentric angle iff. The corresponding 
value of 0 is then given by equation (7.27) and the r value by equation (7.24) which, in 
view of (7.27), can be written in the form 

/• = a (1 — e cos iff) . (7.30) 

The need to solve Kepler’s equation for the unknown iff was a major stimulus in the 
development of approximate numerical methods for finding roots of equations. 

Example 7.4 Kepler’s equation 

A body moving in an inverse square attractive field traverses an elliptical orbit with 
eccentricity e and period r. Find the time taken for the body to traverse the half of 
the orbit that is nearer the centre of force. 

Solution 

The half of the orbit nearer the centre of force corresponds to the range —tz1 2 < iff < 
7t/2. The time taken is therefore 

id-')-'(H)- 

For example, Halley’s comet moves on an elliptic orbit whose eccentricity is almost 
unity. It therefore spends only about 18% of its time on the half of its orbit that is 
nearer the Sun. 
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FIGURE 7.10 Two planets move on the circular 
orbits A and B. A spacecraft is required to 
depart from one planet and rendezvous with the 
other planet at some point of its orbit. The 
Hohmann orbit H achieves this with the least 
expenditure of fuel. 

7.6 SPACE TRAVEL - HOHMANN TRANSFER ORBITS 

An important problem in space travel, and one that nicely illustrates the preceding 
theory, is that of transferring a spacecraft from one planet to another (from Earth to Jupiter 
say). In order to simplify the analysis, we will assume that both of the planetary orbits are 
circular. We will also suppose that the spacecraft has already effectively been removed 
from Earth’s gravity, but is still in the vicinity of the Earth and is orbiting the Sun on the 
same orbit as the Earth. The object is to use the rocket motors to transfer the spacecraft 
to the vicinity of Jupiter, orbiting the Sun on the same orbit as Jupiter. Like everything 
else on board a spacecraft, fuel has to be transported from Earth at huge cost, so the 
transfer from Earth to Jupiter must be achieved using the least mass of fuel. In our analysis 
we will neglect the time during which the rocket engines are firing so that the engines 
are regarded as delivering an impulse to the spacecraft, resulting in a sudden change of 
velocity. After the initial firing impulse, the spacecraft is assumed to move freely under 
the Sun’s gravitation until it reaches the orbit of Jupiter, when a second firing impulse is 
required to circularise the orbit. This is called a two-impulse transfer. 

If the two firings produce velocity changes of Av a and Av l{ respectively, then the 
quantity Q that must be minimised if the least fuel is to be used is 

Q = |Ai/| + |Av fi |. 

The orbit that connects the two planetary orbits and minimises Q is called the Hohmann 
transfer orbit* and is shown in Figure 7.10. It has its perihelion at the lift-off point L and 
its aphelion at the rendezvous point R. It is not at all obvious that this is the optimal orbit; 
a proof is given in Appendix B at the end of the chapter. However, it is quite easy to find 
its properties. 

Since the perihelial and aphelial distances in the Hohmann orbit are A and B (the radii 
of the orbits of Earth and Jupiter), it follows that 

A = a{\ — e), B = a( 1 + e), 



* After Walter Hohmann, the German space research pioneer. 
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so that the geometrical parameters of the orbit are given by 

B - A 


a 


(B + A ), 


2 ' ' " B + A 

The angular momentum L of the orbit is then given by the L-formula to be 

yb 2 ( 2 \ yBA 




B + A 


where y = M©G. 

From L we can find the speed V L of the spacecraft just after the lift-off firing, and 
the speed V R at the rendezvous point just before the second firing. These are 


V L 


( 2 yB \ l/2 

\A(B + A)) 


V R 


2yA \ l/2 
B(B + A) J 


The travel time T , which is half the period of the Hohmann orbit, is given by 


2 _ * 2ai _ * 2 (B + A) 3 
Y 

Finally, in order to rendezvous with Jupiter, the lift-off must take place when Earth 
and Jupiter have the correct relative positions, so that Jupiter arrives at the meeting point 
at the right time. Since the speed of Jupiter is ( y/B )*/ 2 and the travel time is now known, 
the angle rjs in Figure 7.10 must be 


\jf = 71 


f B + A 
V 2 B 


3/2 


Numerical results for the Earth-Jupiter transfer 

In astronomical units, G =4n 2 , A = 1 AU and, for Jupiter, B = 5.2 AU. A speed of 1 AU 
per year is 4.74 km per second. Simple calculations then give: 

(i) The travel time is 2.73 years, or 997 days. 

(ii) V L is 8.14 AU per year, which is 38.6 km per second. This is the speed the spacecraft 
must have after the lift-off firing. 

(iii) V R is 1.56 AU per year, which is 7.4 km per second. This is the speed with which the 
spacecraft arrives at Jupiter before the second firing. 

(iv) The angle \j/ at lift-off must be 83°. 

The speeds V L and V R should be compared with the speeds of Earth and Jupiter in their 
orbits. These are 29.8 km/sec and 13.1 km/sec respectively. Thus the first firing must boost the 
speed of the spacecraft from 29.8 to 38.6 km/scc, and the second firing must boost the speed 
from 7.4 to 13.1 km/sec. The sum of these speed increments, 14.5 km/sec, is greater than 
the speed increment needed (12.4 km/sec) to escape from the Earth’s orbit to infinity. Thus 
it takes more fuel to transfer a spacecraft from Earth’s orbit to Jupiter’s orbit than it does to 
escape from the solar system altogether! 
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7.7 THE REPULSIVE INVERSE SQUARE FIELD 

The force field with /(;•) = +y/r 2 , (y > 0), is the repulsive inverse square 
field. It occurs in the interaction of charged particles carrying like charges and is required 
for the analysis of Rutherford scattering. Below we summarise the important properties 
of orbits in a repulsive inverse square field. These results are obtained in exactly the same 
way as for the attractive case. 

The paths 

The path equation is 

d 2 u y 

where L is the angular momentum of the orbit. Its general solution can be written in the 
form 

- = 7 ^- [-1 + e cos(0 - a)], 
r L- 

where e, a are constants with e > 0. By comparing this path with the standard polar 
forms of conics given in Appendix A, we see that the path can only be the far branch of a 
hyperbola with focus at the centre O. 

The L- and E-formulae 

The formulae relating L, E, the dynamical constants of the orbit, to the hyperbola param¬ 
eters are 


L 2 = yb 2 /a, 

(7.31) 

E = + y /2a. 

(7.32) 


7.8 RUTHERFORD SCATTERING 

The most celebrated application of orbits in a repulsive inverse square field is 
Rutherford’s* famous experiment in which a beam of alpha particles was scattered by 
gold nuclei in a sheet of gold leaf. We will analyse Rutherford’s experiment in detail, 
beginning with the basic problem of a single alpha particle being deflected by a single 
fixed gold nucleus. 

Alpha particle deflected by a heavy nucleus 

An alpha particle A of mass m and charge q approaches a gold nucleus B of charge Q 
(see Figure 7.11). B is initially at rest and A is moving with speed V along a line whose 


* Ernest Rutherford (1871-1937), a New Zealander, was one of the greatest physicists of the twentieth 
century. His landmark work on the structure of the nucleus in 1911 (and with Geiger and Marsden in 
1913) was conducted at the University of Manchester, England. 
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FIGURE 7.11 The alpha particle A of mass m and charge 
q is repelled by the fixed nucleus B of charge Q and 
moves on a hyperbolic orbit with the nucleus at the far 
focus. The alpha particle is dcllected through the angle 6. 


perpendicular distance from B is p. In the present treatment, we neglect the motion of the 
gold nucleus. This is justified since the mass of the gold nucleus is about fifty times larger 
than that of the alpha particle. A then moves in the electrostatic field due to B , which we 
now suppose to be fixed at the origin O. The force exerted on A is then 


F = 



in cgs units. This is the repulsive Inverse square field with y = qQ/m. 

We wish to find 6, the angle through which the alpha particle is deflected. This is 
obtained in exactly the same way as that of the asteroid in Example 7.1. From the initial 
conditions, L = pV and E = \V 2 . The L-formula (7.31) and the E-formula (7.32) then 
give 


It follows that 


p 2 V 2 



-V 2 = + — 
2 + 2 a 


a = 



b = p. 


The semi-angle a between the asymptotes of the hyperbola is then given (see Appendix 
A) by 


b pV 2 

tan a = - = -. 

a y 

Hence, 6, the angle through which the asteroid is deflected, is given by 


tan(0/2) = tan(7r/2 — a) = cot a = - - 

pV~ 
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FIGURE 7.12 General scattering. A typical particle crosses the reference plane at the point p and 
finally emerges in the direction of the unit vector /?. Particles that cross the reference plane within the 
region A emerge within the (generalised) cone shown. 

On writing y = c/Q/m, we obtain 

tan(0/2) = -?%=. (7.33) 

as the formula for the deflection angle of the alpha particle. The quantity p, the distance 
by which the incident particle would miss the scatterer if there were no interaction, is 
called the impact parameter of the particle. 

The deflection formula (7.33) cannot be confirmed directly by experiment since this 
would require the observation of a single alpha particle, a single nucleus, and a knowledge 
of the impact parameter p. What is actually done is to irradiate a gold target by a uniform 
beam of alpha particles of the same energy. Thus the target consists of many gold nuclei 
together with their associated electrons. However, the electrons have masses that are very 
small compared to that of an alpha particle and so their influence can be disregarded. 
Also, the gold target is taken in the form of thin foil to minimise the chance of multiple 
collisions. If multiple collisions are eliminated, then the gold nuclei act as independent 
scatterers and the problem reduces to that of a single fixed gold nucleus irradiated by a 
uniform beam of alpha particles. In this problem the alpha particles come in with different 
values of the impact parameter p and are scattered through different angles in accordance 
with formula (7.33). What can be measured is the angular distribution of the scattered 
alpha particles. 

Differential scattering cross-section 

The angular distribution of scattered particles is expressed by a function cr(n), called the 
differential scattering cross section, where the unit vector n specifies the final direction 
of emergence of a particle from the scatterer O. One may imagine the values of n corre¬ 
sponding to points on the surface of a sphere with centre O and unit radius, as shown in 
Figure 7.12. Then values of n that lie in the shaded patch S correspond to particles whose 
final direction of emergence lies inside the (generalised) cone shown. 
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FIGURE 7.13 Axisymmetric scattering. Particles crossing the reference plane 
within the shaded circular disk are scattered and emerge in directions within the 
circular cone. 

Take a reference plane far to the left of the scatterer and perpendicular to the incident 
beam, as shown in Figure 7.12. Suppose that there is a uniform flux of incomimg parti¬ 
cles crossing the reference plane such that N particles cross any unit area of the reference 
plane in unit time. When these particles have been scattered, they will emerge in different 
directions and some of the particles will emerge with directions lying within the (gener¬ 
alised) cone shown in Figure 7.12. The differential scattering cross section is defined 
to be that function a(n) such that the flux of particles that emerge with directions lying 
within the cone is given by the surface integral 



(7.34) 


Js 


It is helpful to regard a (n) as a scattering density, analogous to a probability density, that 
must be integrated to give the flux of particles scattered within any given solid angle. 

The particles that finally emerge within the cone must have crossed the reference plane 
within some region A as shown in Figure 7.12. A typical particle crosses the reference 


plane at the point p (relative to O') and eventually emerges in the direction n lying within 


the cone. However because the incoming beam is uniform, the flux of these particles 
across A is just (V|.4|, where |.4| is the area of the region A. On equating the incoming 
and outgoing fluxes, we obtain the relation 


cr(n) (IS = \A\ 


(7.35) 


This is the general relation that any differential scattering cross section must satisfy; it 
simply expresses the equality of incoming and outgoing fluxes of particles. However, 
Rutherford scattering is axisymmetric and this provides a major simplification. 

Axisymmetric scattering and Rutherford’s formula 

Rutherford scattering is simpler than the general case outlined above in that the problem 
is axisymmetric about the axis O'O. Thus a depends on 6 (the angle between n and the 
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axis O'O), but is independent of 0 (the azimuthal angle measured around the axis). In 
this case cr(0) can be determined by using the axisymmetric regions shown in Figure 7.13. 
Particles that cross the reference plane within the circle centre O' and radius p\ emerge 
within the circular cone 0\ < 0 < n , where p\ and 0\ are related by the deflection formula 
for a single particle, in our case formula (7.33). On applying equation (7.35) to the present 
case, we obtain 


L 


cr(0) dS = np\. 


We evaluate the surface integral using 0, 0 coordinates. The element of surface area on 
the unit sphere is given by dS = sin 0 d0d(p so that 


f cr{0) dS — f ! f or (0) sin 0 d<p 
Js Je, \J o 


< 9 \ I «/0 


d0 


= 2jt f cr (0) sin 0 d0. 

Je i 


Hence 


2tt I cr{0) sin 0 d0 = itp 2 x 

J 9\ 

rp i 

= 2 7T I p dp 

Jo 



on changing the integration variable from p to 0. Here the impact parameter p is regarded 
as a function of the scattering angle 0. Now the above equality holds for all choices of the 
integration limit 0\ and this can only be true if the two integrands are equal. Hence: 


Axisymmetric scattering cross section 


<r(0) = 


P \ Jp_ 
sin#/ d0 


(7.36) 


This is the formula for the differential scattering cross section a in any problem of axisym¬ 
metric scattering. All that is needed to evaluate it in any particular case is the expression 
for the impact parameter p in terms of the scattering angle 0. 

In the case of Rutherford scattering, the expression for p in terms of 0 is provided 
by solving equation (7.33) for p, which gives 

q Q 

p = —— tan(0/2). 
mV A 
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On substituting this function into the formula (7.36), we obtain 


Rutherford’s scattering cross-section 


(7(0) = 


< y 2 (? 2 

16£ 2 



(7.37) 


where E(= jmV ~) is the energy of the incident alpha particles. This is Rutherford’s 
formula for the angular distribution of the scattered alpha particles. 

Significance of Rutherford’s experiment 

In the above description we have used the temi ‘nucleus’ for convenience. What we 
really mean is ‘the positively charged part of the atom that carries most of the mass'. 
If this positive charge is distributed in a spherically symmetric manner, then the above 
results still hold, irrespective of the radius of the charge, provided that the alpha particles 
do not penetrate into the charge itself. What Rutherford found was that, when using 
alpha particles from a radium source, the formula (7.37) held even for particles that were 
scattered through angles close to n. These are the particles that get closest to the nucleus, 
the distance of closest approach being qQ/E. This meant that the nuclear radius of gold 
must be smaller than this distance, which was about 10 -12 cm in Rutherford’s experiment. 
The radius of an atom of gold is about 10 -8 cm. This result completely contradicted the 
Thompson model, in which the positive charge was distributed over the whole volume of 
the atom, by showing that the nucleus (as it became known) must be a very small and very 
dense core at the centre of the atom. 

Note on two-body scattering problems 

Throughout this section we have neglected the motion of the target nucleus. This will introduce 
only small errors when the target nucleus is much heavier than the incident particles, as it was 
in Rutherford’s experiment. However, if lighter nuclei arc used as the target, then the motion of 
the nucleus cannot be neglected and we have a two-body scattering problem. Such problems 
are treated in Chapter 10. 


Appendix A The geometry of conics 


Ellipse 

(i) In Cartesian coordinates, the standard ellipse with semi-major axis a and semi-minor axis b (b < a) 
has the equation 


2 2 

x A y z 

-77 + 4 = I- 


6 2 
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y 



y 



FIGURE 7.15 The standard hyperbola x 2 /a 2 — y 2 /b 2 = 1. The 
near and far branches are relative to the focus F\ which is the 
origin of polar coordinates. 


(ii) The eccentricity e of the ellipse is defined by 


e z = 1 - 


and lies in the range 0 < e < 1. When e = 0, b = a and the ellipse is a circle. 

(iii) The focal points F , F' of the ellipse lie on the major axis at (=bac, 0). 

(iv) In polar coordinates with origin at the focus F and with initial line in the positive a*-direction, the 
equation of the ellipse is 

1 a 

- = -3-0 T c cos 0}. 
r b 2 
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FIGURE 7.16 The circular orbits A and J3 are the orbits of the two planets. The 
elliptical orbit shown is a possible path for the spacecraft, which travels along 
the arc LR. The velocities shown are those after the first firing at L and before 
the second firing at R. 


Hyperbola 

(i) In Cartesian coordinates, the standard hyperbola has the equation 

= * (a, b > 0) 

a z b z 

so that the angle 2a between the asymptotes is given by 

b 

tan c* = 

a 

(ii) The eccentricity e of the hyperbola is defined by 


* = i + - 

a z 


and lies in the range e > 1. 

(iii) The focal points F, F' of the hyperbola lie on the A-axis at (d zae, 0). 

(iv) In polar coordinates with origin at the focus F f and with initial line in the positive .v-direction, the 
equations of the near and far branches of the hyperbola are 

1 a 1 a 

- =-=■(!+ e cos 0), - = -=-(-1 + ccos0), 

r b 1 r b z 

respectively. 


Appendix B The Hohmann orbit is optimal 


The result that the Hohmann orbit is the connecting orbit that minimises Q is not at all 
obvious and correct proofs arc rare.* Hopefully, the proof given below is correct! 


* It is sometimes stated that the optimality requirement is to minimise the energy of the connecting orbit, 
which is not true. In any case, the Hohmann orbit is not the connecting orbit of minimum energy! 
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Proof of optimality Consider the general two-impulse transfer orbit LR shown in Figure 7.16, where 
the orbit is regarded as being generated by the velocity components Vg , vf of the spacecraft after the first 
impulse. Then, by angular momentum and energy conservation. 


*”8 = 


W + W-T-W + W"T- 

where A , B are the radii of the circular orbits of Earth and Jupiter and y = MqG. Thus 


V n - -v A 
v 0 - B v 0 ’ 


Since the orbital speeds of Earth and Jupiter are (y/A) 1 / 2 and ( y/B ) 1 ^ 2 , it follows that the velocity changes 
Air 4 , Av^ required at L and R have magnitudes given by 

|a^P = (^-(I ) i/2 ) 2 + (^) 2 , 


|Ar B | 2 = 


(© 1 , 2 -’ 0 2 +( o 2 


A B) 




It is evident that, with Vg fixed, both | Au^| and |Au /? | are increasing functions of vf. Thus Q may be 
reduced by reducing provided that the resulting orbit still meets the circle r = B. Q can be thus reduced 
until either 

(i) is reduced to zero, or 

(ii) the orbit shrinks until it touches the circle r = B and any further reduction in v^ would mean that 
the orbit would not meet r = B. 

In the first case, L becomes the perihelion of the orbit and, in the second case, R becomes the aphelion of 
the orbit. Wc will proceed assuming the first case, the second case being treated in a similar manner and 
with the same result. 

Suppose then that L is the perihelion of the connecting orbit. Then vf = 0 and, from now on, we 
will simply write v instead of v^. The velocity v must be such that the orbit reaches the circle r = B , 
which now means that the major axis of the orbit must not be less than A + B. On using the E-formula, this 
implies that v must satisfy 

2 ^ 2 r H 

v > -. 

" A{A + B) 

The formulae for |Av^| and |Au^| now simplify to 

I 


lAt^l 2 




2 

~A 


B 3 
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from which it is evident that, for v in the permitted range, both of | Air 4 1 and | Av^ | are increasing functions 
v. Hence the minimum value of Q is achieved when v takes its smallest permitted value, namely 

2yB y /2 . 

V A(A + B)J 

With this value of u, the orbit touches the circle r = B and so has its aphelion at R. Hence the optimum 
orbit has its perihelion at L and its aphelion at R. This is precisely the Hohmann orbit. ■ 


Problems on Chapter 7 

Answers and comments are at the end of the book. 

Harder problems carry a star (*). 

Radial motion equation, apses 

7.1 A particle P of mass /// moves under the repulsive inverse cube field F = (my/r 3 )'r. 
Initially P is at a great distance from O and is moving with speed V towards O along a 
straight line whose perpendicular distance from O is p. Find the equation satisfied by the 
apsidal distances. What is the distance of closest approach of P to 0? 

7.2 A particle P of mass m moves under the attractive inverse square field F = —(my/r 2 )'r. 
Initially P is at a point C, a distance c from 0, when it is projected with speed ( y/c) ] ^ in a 
direction making an acute angle a with the line OC. Find the apsidal distances in the resulting 
orbit. 

Given that the orbit is an ellipse with 0 at a focus, find the semi-major and semi-minor 
axes of this ellipse. 

7.3 A particle of mass ni moves under the attractive inverse square field F = —{my/r 2 )'r. 
Show that the equation satisfied by the apsidal distances is 

2Er 2 + 2yr - L 2 = 0, 

where E and L are the specific total energy and angular momentum of the particle. When 
E < 0, the orbit is known to be an ellipse with 0 as a focus. By considering the sum and 
product of the roots of the above equation, establish the elliptic orbit formulae 

L 2 = yb 2 /a, E = -y/2a. 

7.4 A particle P of mass m moves under the simple harmonic field F = — {mQrr)7, where 
£2 is a positive constant. Obtain the radial motion equation and show that all orbits of P arc 
bounded. 

Initially P is at a point C, a distance c from O, when it is projected with speed £2c in 
a direction making an acute angle a with OC. Find the equation satisfied by the apsidal 
distances. Given that the orbit of P is an ellipse with centre O , find the semi-major and semi¬ 
minor axes of this ellipse. 
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Path equation 

7.5 A particle P moves under the attractive inverse square field F — — (my/r 2 )#\ Initially 
P is at the point C, a distance c from O, and is projected with speed Oy/c) 1 / 2 perpendicular 
to OC. Find the polar equation of the path make a sketch of it. Deduce the angle between OC 
and the final direction of departure of P . 


7.6 A comet moves under the gravitational attraction of the Sun. Initially the comet is at a 
great distance from the Sun and is moving towards it with speed V along a straight line whose 
perpendicular distance from the Sun is p. By using the path equation, find the angle through 
which the comet is deflected and the distance of closest approach. 


7.7 A particle P of mass m moves under the attractive inverse cube field F = —(my 2 /r i )'r, 
where y is a positive constant. Initially P is at a great distance from O and is projected 
towards O with speed V along a line whose perpendicular distance from O is p. Obtain the 
path equation for P. 

For the case in which 


V = 


1 5y 

y/209 p' 


find the polar equation of the path of P and make a sketch of it. Deduce the distance of closest 
approach to O , and the final direction of departure. 


7.8* A particle P of mass m moves under the central field F = —(my 2 /r 5 )'r, where y is 
a positive constant. Initially P is at a great distance from O and is projected towards O with 
speed y/2y/p 2 along a line whose perpendicular distance from O is p. Show that the polar 
equation of the path of P is given by 


Make a sketch of the path. 




7.9* A particle of mass /// moves under the central field 



where y and a are positive constants. Initially the particle is at a distance a from the centre of 
force and is projected at right angles to the radius vector with speed 3y /\fla. Find the polar 
equation of the resulting path and make a sketch of it. 

Find the time taken for the particle to reach the centre of force. 


Nearly circular orbits 

7.10 A particle of mass m moves under the central field 

,-er/a 


F = —in 


ye 


,■2 


r, 
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where y, a and c are positive constants. Find the apsidal angle for a nearly circular orbit of 


radius a. When c is small, show that the perihelion of the orbit advances by approximately tt€ 


on each revolution. 

7.11 Solar oblateness A planet of mass m moves in the equatorial plane of a star that is a 
uniform oblate spheroid. The planet experiences a force field of the form 



approximately, where y, a and c are positive constants and e is small. If the planet moves 
in a nearly circular orbit of radius a, find an approximation to the ‘annual’ advance of the 
perihelion. [It has been suggested that oblatcncss of the Sun might contribute significantly to 
the precession of the planets, thus undermining the success of general relativity. This point has 
yet to be resolved conclusively.) 

7.12 Suppose the solar system is embedded in a dust cloud of uniform density p. Find an 
approximation to the ‘annual’ advance of the perihelion of a planet moving in a nearly circular 
orbit of radius a. (For convenience, let p = eM /a 3 , where M is the solar mass and e is 
small.) 

7.13 Orbits in general relativity In the theory of general relativity, the path equation for a 
planet moving in the gravitational field of the Sun is, in the standard notation. 



where c is the speed of light. Find an approximation to the ‘annual’ advance of the perihelion 
of a planet moving in a nearly circular orbit of radius a. 

Scattering 

7.14 A uniform flux of particles is incident upon a fixed hard sphere of radius a. The particles 
that strike the sphere are reflected elastically. Find the differential scattering cross section. 

7.15 A uniform flux of particles, each of mass m and speed V, is incident upon a fixed scat- 
tercr that exerts the repulsive radial force F = {my 2 /r 3 ) r. Find the impact parameter p as 
a function of the scattering angle 6, and deduce the differential scattering cross section. Find 
the total back-scattering cross-section. 

Assorted inverse square problems 

Some useful data: 

The radius R of the Earth is 6380 km. To obtain the value of A7G, where M is the mass of the 
Earth, use the formula MG = R 2 g, where g = 9.80 ms -2 . 

1 AIJ per year is 4.74 km per second. In astronomical units, G = 4n 2 . 
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7.16 In Yuri Gagarin’s first manned space flight in 1961, the perigee and apogee were 181 
km and 327 km above the Earth. Find the period of his orbit and his maximum speed in the 
orbit. 

7.17 An Earth satellite has a speed of 8.60 km per second at its perigee 200 km above the 
Earth’s surface. Find the apogee distance above the Earth, its speed at the apogee, and the 
period of its orbit. 

7.18 A spacecraft is orbiting the Earth in a circular orbit of radius c when the motors are 
fired so as to multiply the speed of the spacecraft by a factor k (k > 1), its direction of 
motion being unaffected. [You may neglect the time taken for this operation.! Find the range 
of k for which the spacecraft will escape from the Earth, and the eccentricity of the escape 
orbit. 

7.19 A spacecraft travelling with speed V approaches a planet of mass M along a straight 
line whose perpendicular distance from the centre of the planet is p. When the spacecraft 
is at a distance c from the planet, it fires its engines so as to multiply its current speed by a 
factor k (0 < k < 1), its direction of motion being unaffected. [You may neglect the time 
taken for this operation.] Find the condition that the spacecraft should go into orbit around the 
planet. 

7.20 A body moving in an inverse square attractive field traverses an elliptical orbit with major 
axis 2a. Show that the time average of the potential energy V = —y/r is —y /a. [Transform 
the time integral to an integral with repect to the eccentric angle \fs.] 

Deduce the time average of the kinetic energy in the same orbit. 

7.21 A body moving in an inverse square attractive field traverses an elliptical orbit with 
eccentricity e and major axis 2a. Show that the time average of the distance r of the body 
from the centre of force is a{\ + |e 2 ). [Transform the time integral to an integral with respect 
to the eccentric angle \j/.] 

7.22 A spacecraft is ‘parked’ in a circular orbit 200 km above the Earth’s surface. The space¬ 
craft is to be sent to the Moon’s orbit by Hohmann transfer. Find the velocity changes Av E 
and Av m that are required at the Earth and Moon respectively. How long does the jour¬ 
ney take? [The radius of the Moon’s orbit is 384,000 km. Neglect the gravitation of the 
Moon.] 

7.23* A spacecraft is ‘parked’ in an elliptic orbit around the Earth. What is the most fuel 
efficient method of escaping from the Earth by using a single impulse? 

7.24 A satellite already in the Earth’s heliocentric orbit can fire its engines only once. What 
is the most fuel efficient method of sending the satellite on a ‘flyby’ visit to another planet? 
The satellite can visit cither Mars or Venus. Which trip would use less fuel? Which trip 
would take the shorter time? [The orbits of Mars and Venus have radii 1.524 AU and 0.723 
AU respectively.] 
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7.25 A satellite is ‘parked’ in a circular orbit 250 km above the Earth’s surface. What is the 
most fuel efficient method of transferring the satellite to an (elliptical) synchronous orbit by 
using a single impulse? [A synchronous orbit has a period of 23 hr 56 m.] Find the value of 
Av and apogee distance. 


Effect of resistance 

7.26 A satellite of mass m moves under the attractive inverse square field — (in y /r 2 ) T and is 
also subject to the linear resistance force —mK v , where A" is a positive constant. Show that 
the governing equations of motion can be reduced to the form 


F + Ki ' + 72~ 


I 2 e -2Kt 

i-3 


= 0, 


2a , -Kt 
r 0 — Loe , 


where L o is a constant which will be assumed to be positive. 

Suppose now that the effect of resistance is slight and that the satellite is executing a ‘cir¬ 
cular’ orbit of slowly changing radius. By neglecting the terms in r and r, find an approximate 
solution for the time variation of r and 0 in such an orbit. Deduce that small resistance causes 
the circular orbit to contract slowly, but that the satellite speeds up! 

7.27 Repeat the last problem for the case in which the particle moves under the simple har¬ 
monic attractive field —(mQ. 2 r)'f with the same law of resistance. Show that, in this case, 
the body slows down as the orbit contracts. [This problem can be solved exactly in Cartesian 
coordinates, but do not do it this way.] 


Computer assisted problems 

7.28 See the advance of the perihelion of Mercury It is possible to ‘see’ the advance of 
the perihelion of Mercury predicted by general relativity by direct numerical solution. Take 
Einstein’s path equation (see Problem 7.13) in the dimensionless form 


d 2 v 
~d0 2 


+ u = 



+ rjv 2 , 


where v = an. Here a and e are the semi-major axis and eccentricity of the non-relativistic 
elliptic orbit and = 3A/ G/ac 2 is a small dimensionless parameter. For the orbit of Mercury, 
i] = 2.3 x 10 -7 approximately. 

Solve this equation numerically with the initial conditions r = a (1 + e) and r = 0 when 
0 — 0; this makes 0 — 0 an aphelion of the orbit. To make the precession easy to see, use 
a fairly eccentric ellipse and take ti to be about 0.005, which speeds up the precession by a 
factor of more than 10 4 ! 

7.29 Orbit with linear resistance Confirm the approximate solution for small resistance 
obtained in Problem 7.26 by numerical solution of the governing simultaneous ODEs. First 
write the governing equations in dimensionless form. Suppose that, in the absence of 
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resistance, a circular orbit with r = a and 6 = Q. is possible; then y = a 3 Q, and Lq = o 2 Sl. 
On taking dimensionless variables p, x defined by p = r/a and r = Sit, and taking Lo = a 2 Sl, 
the governing equations become 


d 2 p dp 1 e 2ix 
dx 2 + dx p 2 p 3 



= e 


—2er 


where c — K / SI is the dimensionless resistance parameter. Solve these equations with the 
initial conditions p = 1, dp/dx = 0 and 0 = 0 when x = 0. Choose some small value for c 
and plot a polar graph of the path. 
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Chapter Eight 




Non-linear oscillations 
and phase space 


KEY FEATURES 

The key features of this chapter are the use of perturbation theory to solve weakly non-linear 
problems, the notion of phase space, the Poincare-Bendixson theorem, and limit cycles. 


In reality, most oscillating mechanical systems are governed by non-linear equations. 
The linear oscillation theory developed in Chapter 5 is generally an approximation which 
is accurate only when the amplitude of the oscillations is small. Unfortunately, non-linear 
oscillation equations do not have nice exact solutions as their linear counterparts do, and 
this makes the non-linear theory difficult to investigate analytically. 

In this chapter we describe two different analytical approaches, each of which is suc¬ 
cessful in its own way. The first is to use perturbation theory to find successive cor¬ 
rections to the linear theory. This gives a more accurate solution than the linear theory 
when the non-linear terms in the equation are small. However, because the solution is 
close to that predicted by the linear theory, new phenomena associated with non-linearity 
are unlikely to be discovered by perturbation theory! The second approach involves the 
use of geometrical arguments in phase space. This has the advantage that the non-linear 
effects can be large, but the conclusions are likely to be qualitative rather than quantita¬ 
tive. A particular triumph of this approach is the Poincare-Bendixson theorem, which 
can be used to prove the existence of limit cycles, a new phenomenon that exists only in 
the non-linear theory. 

8.1 PERIODIC NON-LINEAR OSCILLATIONS 

Most oscillating mechanical systems are not exactly linear but are approximately 
linear when the oscillation amplitude is small. In the case of a body on a spring, the 
restoring force might actually have the form 

S = mO, 2 x + mAx^, (8.1) 

which is approximated by the linear formula S = mQrx when the displacement x is small. 
The new constant A is a measure of the strength of the non-linear effect. If A < 0, then 
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FIGURE 8.1 Existence of periodic oscillations for the quartic 
potential energy V = 4 m&x 2 + ^mAx 4 with A < 0. 

S is less than its linear approximation and the spring is said to be softening as x increases. 
Conversely, if A >0, then the spring is hardening as x increases. The formula (8.1) is 
typical of non-linear restoring forces that are symmetrical about x = 0. If the restoring 
force is unsymmetrical about x = 0, the leading correction to the linear case will be a 
term in x 2 . 


Existence of non-linear periodic oscillations 

Consider the free undamped oscillations of a body sliding on a smooth horizontal table* 
and connected to a fixed point of the table by a spring whose restoring force is given by 
the cubic formula (8.1). In rectilinear motion, the governing equation is then 


d 2 x 

Hi 2 


+ Q, 2 x + Ax 3 


= 0 , 


(8.2) 


which is Duffing’s equation with no forcing tenn (see section 8.5). The existence of 
periodic oscillations can be proved by the energy method described in Chapter 6. The 
restoring force has potential energy 


V = jtnQrx 2 + ^m Ax 4 , 


so that the energy conservation equation is 


jmv 1 + jmQ,~x z + |/?/A.v 4 = £, 

where v — x. The motion is therefore restricted to those values of x that satisfy 


ImQ, 2 x 2 + jmAx 4 < E, 


* Would the motion be the same (relative to the equilibrium position) if the body were suspended vertically 
by the same spring? 
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with equality when v = 0. Figure 8.1 shows a sketch of V for a softening spring (A < 0). 
For each value of E in the range 0 < E < V max , the particle oscillates in a symmetrical 
range — a < x < a as shown. Thus oscillations of any amplitude less than a max (= 
£2/| A | l / 2 ) are possible. For a hardening spring, oscillations of any amplitude whatsoever 
are possible. 


Solution by perturbation theory 

Suppose then that the body is performing periodic oscillations with amplitude a. In order 
to reduce the number of parameters, we non-dimensionalise equation (8.2). Let the dimen¬ 
sionless displacement X be defined by x = aX. Then X satisfies the equation 


1 d 2 X 
Q, 2 dt 2 


+ X+eX 3 = 0, 


(8.3) 


with the initial conditions X = 1 and dX/dt = 0 when t = 0. The dimensionless 
parameter e, defined by 


c 


a 2 A 


(8.4) 


is a measure of the strength of the non-linearity. Equation (8.3) contains € as a parameter 
and hence so does the solution. A major feature of interest is how the period r of the 
motion varies with c. 

The non-linear equation of motion (8.3) cannot be solved explicitly but it reduces 
to a simple linear equation when the parameter € is zero. In these circumstances, one 
can often find an approximate solution to the non-linear equation valid when c is small. 
Equations in which the non-linear tenns are small are said to be weakly non-linear and 
the solution technique is called perturbation theory. There is a well established theory 
of such perturbations. The simplest case is as follows: 


Regular perturbation expansion 

If the parameter c appears as the coefficient of any term of an ODE that is not the 
highest derivative in that equation, then, when € is small, the solution corresponding 
to fixed initial conditions can be expanded as a power series in e. 


This is called a regular perturbation expansion* and it applies to the equation (8.3). 
It follows that the solution X(t,c) can be expanded in the regular perturbation series 

X(t,€) = X 0 (t) + €X l (t) + € 2 X 2 (t) + --- . (8.5) 


* The case in which the small parameter multiplies the highest derivative in the equation is called a singular 
perturbation. For experts only! 
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The standard method is to substitute this series into the equation (8.3) and then to try to 
determine the functions Xo(t), X \ ( t ), X 2 (r ),... . In the present case however, this leads 
to an unsatisfactory result because the functions X i(/), X 2 (/), ... , turn out to be non¬ 
periodic (and unbounded) even though the exact solution X (/, €) is periodic!* Also, it is 
not clear how to find approximations to r from such a series. 

This difficulty can be overcome by replacing t by a new variable s so that the solution 
X ( 5 , e) has period 2jt in s whatever the value of e. Every term of the perturbation scries 
will then also be periodic with period 2n. This trick is known as Lindstedt’s method. 

Lindstedt’s method 

Let o>(e) (= 2tz/t(€)) be the angular frequency of the required solution of equation (8.3). 
Now introduce a new independent variable 5 (the dimensionless time) by the equation 
s = co(€)t. Then X ( 5 , e) satisfies the equation 

("tt) X " + x + €x3 = ° (8.6) 

with the initial conditions X = 1 and X' — 0 when 5 = 0. (Here ' means d/ds.) We now 
seek a solution of this equation in the form of the perturbation series 

X(s,e) = Xo(s) + eX,(s) + e 2 X 2 (s) + --- . (8.7) 

which is possible when e is small. By construction, this solution must have period 27 t for 
all e from which it follows that each of the functions Xo(5), X[(5), X 2 (5), ...must also 
have period 2n. However we have paid a price for this simplification since the unknown 
angular frequency co{e) now appears in the equation (8.6); indeed, the function o>(e) is 
part of the answer to this problem! We must therefore also expand co(e) as a perturbation 
series in e. From equation (8.3), it follows that cd( 0) = £2 so we may write 

a> (e) 1 

——— = 1 + (o\€ + ttoc - + • • • , (8.8) 

u L 

where oq, ( 02 ,... arc unknown constants that must be determined along with the functions 
X () (5),X,(5),X 2 (5),... . 

On substituting the expansions (8.7) and (8.8) into the governing equation (8.6) and 
its initial conditions, we obtain: 

(1 + co\€ + a> 2 e 2 + • • • )2(Xj + cX^ + c 2 X2 + •••) + 

(X 0 + eXi + e 2 X 2 + •••) + €(X 0 + eX, + e 2 X 2 + • • • ) 3 = 0, 


* This ‘paradox’ causes great bafflement when first encountered, but it is inevitable when the period r of 
the motion depends on c, as it does in this case. To have a series of non-periodic terms is not wrong, 
as is sometimes stated. However, it is certainly unsatisfactory to have a non-periodic approximation to a 
periodic function. 
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with 


X 0 + eX, +€ 2 X 2 + --- = 1, 
X'+eX', +€ 2 X^ + ••• = 0, 


when 5 = 0. If we now equate coefficients of powers of c in these equalities, we obtain a 
succession of ODEs and initial conditions, the first two of which are as follows: 

From coefficients of e°, we obtain the zero order equation 

X" + X 0 = 0, (8.9) 

with Xo = 1 and X ( ', = 0 when 5=0. 

From coefficients of c *, we obtain the first order equation 

X'{ + Xj = -2(o\Xq - Xq, (8.10) 

with X i = 0 and X\ = 0 when s = 0. 

This procedure can be extended to any number of terms but the equations rapidly become 
very complicated. The method now is to solve these equations in order; the only sticking 
point is how to determine the unknown constants cv\ , a> 2 , ... that appear on the right sides 
of the equations. The solution of the zero order equation and initial conditions is 

Xo = cos 5 (8.11) 

and this can now be substituted into the first order equation (8.10) to give 

X'l + X 1 = 2co\ cos 5 — cos 3 5 

= 5 ( 8 ^i — 3) cos 5 + 5 cos 35, (8.12) 

on using the trigonometric identity cos 35 = 4cos 3 5 — 3 cos 5 . This equation can now 
be solved by standard methods. The particular integral corresponding to the cos 35 on the 
right is —(1 /8) cos 35, but the particular integral corrsponding to the cos 5 on the right is 
(1/2)5 sin 5 , since cos 5 is a solution of the equation X" + X = 0. The general solution of 
the first order equation is therefore 

X 1 = ( 10 1 — 5 sin 5 — J 2 cos 35 + A cos 5 + B sin 5 , 

where A and B are arbitrary constants. Observe that the functions cos 5 , sin 5 and cos 35 
are all periodic with period 2n, but the term 5 sin 5 is not periodic. Thus, the coefficient 
of 5 sin 5 must he zero, for otherwise X \ ( 5 ) would not he periodic, which we know it must 
he. Hence 



(8.13) 
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which determines the first unknown coefficient in the expansion (8.8) of co{c). The solu¬ 
tion of the first order equation and initial conditions is then 


X I = — (cos s — cos 3.v) 
32 


We have thus shown that, when e is small. 


a> 

Q 


i + r +0 ( f2 )- 


(8.14) 


and 


X = cos s H-(cos s — cos 3„v) + O ^e 2 ^ , 

where s = ^1 + + O (e 2 )) 

Results 

When € (= a 2 A/Q. 2 ) is small, the period r of the oscillation of equation (8.2) with 
amplitude a is given by 


2 tt 2n ( 3 

r = = ^ l + 8 e + 0|f ' 


0 


-1 


2tt ( 3 

it '-r +0(f ' 


(• 


(8.15) 


and the corresponding displacement x{t) is given by 

x = a [^cos s + — (cos s — cos3s) + O » 


(8.16) 


where s = ^1 + + O (e 2 )) S2/. 

This is the approximate solution correct to the first order in the small parameter c. 
More terms can be obtained in a similar way but the effort needed increases exponentially 
and this is best done with computer assistance (see Problem 8.15). 

These formulae apply only when € is small, that is, when the non- linearity in the 
equation has a small effect. Thus we have laboured through a sizeable chunk of mathe¬ 
matics to produce an answer that is only slightly different from the linear case. This sad 
fact is true of all regular perturbation problems. However, in non-linear mechanics, one 
must be thankful for even modest successes. 


8.2 THE PHASE PLANE ((xj, x 2 )-plane) 

The second approach that we will describe could not be more different from per¬ 
turbation theory. It makes use of qualitative geometrical arguments in the phase space of 
the system. 
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Systems of first order ODEs 

The notion of phase space springs from the theory of systems of first order ODEs. 
Such systems are very common and need have no connection with classical mechanics. A 
standard example is the predator-prey system of equations 


11 = ax i — bx \X2, 

1 2 = bX\X2 — cx 2, 

which govern the population density jci (/) of a prey and the population density X2(t) of its 
predator. In the general case there are n unknown functions satisfying n first order ODEs, 
but here we will only make use of tn’o unknown functions x\ ( t ), X2 (t) that satisfy a pair 
of first order ODEs of the form 


X[ = F\(xi,x 2 ,t), , Q 

r~ / . s (.O ' 1 ') 

X 2 = F 2 (xi,x 2 , t). 

Just to confuse matters, a system of ODEs like (8.17) is called a dynamical system, 
whether it has any connection with classical mechanics or not! In the predator-prey 
dynamical system, the function F\ = ax\ — bx\X2 and the function F 2 = bx\X2 — CX2. 
In this case F\ and F 2 have no explicit time dependence. Such systems are said to be 
autonomous ; as we shall see, more can be said about the behaviour of autonomous sys¬ 
tems. 

Definition 8.1 Autonomous system A system of equations of the form 


x\ = F\(xi,x 2 ), 
x 2 = F 2 (x i,,v 2 ), 


(8.18) 


is said to be autonomous. 

The phase plane 

The values of the variables x\, X2 at any instant can be represented by a point in the 
(.Y|, * 2 )-plane. This plane is called the phase plane* of the system. A solution of the 
system of equations (8.17) is then represented by a point moving in the phase plane. The 
path traced out by such a point is called a phase path 1 of the system and the set of all 
phase paths is called the phase diagram. In the predator-prey problem, the variables x\, 
X 2 are positive quantities and so the physically relevant phase paths lie in the first quadrant 
of the phase plane. It can be shown that they are all closed curves! (See Problem 8.10). 

Phase paths of autonomous systems 

The problem of finding the phase paths is much easier when the system is autonomous. 
The method is as follows: 


* In the general case with n unknowns, the phase space is rt-dimensional. 
' Also called an orbit of the system. 
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Q 


FIGURE 8.2 Phase diagram for the system 
dx\/dt = JC 2 — 1 , d.x' 2 /dt = — *1 + 2. The 
point E( 2, 1) is an equilibrium point of the 
system. 



► X 


Example 8.1 Finding phase paths for an autonomous system 


Sketch the phase diagram for the autonomous system of equations 


dx i 



Solution 

The phase paths of an autonomous system can be found by eliminating the time 
derivatives. The path gradient is given by 


dx 2 dx 2 /dt 


dx\ dx\/dt 


* 1-2 
X 2 ~ I 


and this is a first order separable ODE satisfied by the phase paths. The general 
solution of this equation is 


(*i -2) 2 + (x 2 - 1) 2 = C 


and each (positive) choice for the constant of integration C corresponds to a phase 
path. The phase paths are therefore circles with centre (2, 1); the phase diagram is 
shown in Figure 8.2. 

The direction in which the phase point progresses along a path can be deduced 
by examining the signs of the right sides in equations (8.18). This gives the signs of 
X] and x 2 and hence the direction of motion of the phase point. ■ 

When the system is autonomous, one can say quite a lot about the general nature of 
the phase paths without finding them. The basic result is as follows: 

Theorem 8.1 Autonomous systems: a basic result Each point of the phase space 
of an autonomous system has exactly one phase path passing through it. 

Proof. Let (a, b) be any point of the phase space. Suppose that the motion of the phase point (xj, x 2 ) 
satisfies the equations (8.18) and that the phase point is at (a, b) when t = 0. The general theory of ODEs 
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then tells us that a solution of the equations (8.18), that satisfies the initial conditions x\ = a, xj = b 
when t = 0, exists and is unique. Let this solution be {X j (/), X 2 V)}, which we will suppose is defined 
for all /, both positive and negative. This phase path certainly passes through the point (a, b) and we must 
now show that there is no other. Suppose then that there is another solution of the equations in which the 
phase point is at (a, b) when t = r, say. This motion also exists and is uniquely determined and, in the 
general case, would not be related to {X \(t), X 2 (t)}- However, for autonomous systems, the right sides of 
equations (8.18) are independent of t so that the two motions differ only by a shift in the origin of time. To 
be precise, the new motion is simply {X \ (t — r), X 2 (t — r)}. Thus, although the two motions are distinct, 
the two phase points travel along the same path with the second point delayed relative to the first by the 
constant time r. Hence, although there are infinitely many motions of the phase point that pass through the 
point ( a , b), they all follow the same path. This proves the theorem. ■ 

Some important deductions follow from this basic result. 


Phase paths of autonomous systems 

• Distinct phase paths of an autonomous system do not cross or touch each other. 

• Periodic motions of an autonomous system correspond to phase paths that are 
simple* closed loops. 


Figure 8.2 showsthe phase paths of an autonomous system. For this system, all of the 
phase paths are simple closed loops and so every motion is periodic. An exception occurs 
if the phase point is started from the point (2, 1). In this case the system has the constant 
solution *1 = 2, X 2 = 1 so that the phase point never moves; for this reason, the point 
(2, 1) is called an equilibrium point of the system. In this case, the ‘path’ of the phase 
point consists of the single point (2, 1). However, this still qualifies as a path and the 
above theory still applies. Consequently no ‘real’ path may pass through an equilibrium 
point of an autonomous system. ■ 


8.3 THE PHASE PLANE IN DYNAMICS ( (x, u)-plane) 

The above theory seems unconnected to classical mechanics since dynamical 
equations of motion are second order ODEs. However, any second order ODE can be 
expressed as a pair of first order ODEs. For example, consider the general linear oscilla¬ 
tor equation 

^+2K^ + Q 2 x = F(t). (8.19) 

dt z dt 

If we introduce the new variable v = dx/dt, then 

dv , 

-1- 2 kv + = F{t). 

dt 


* A simple curve is one that does not cross (or touch) itself (except possibly to close). 

T It may appear from diagrams that phase paths can pass through equilibrium points. This is not so. Such 
a path approaches arbitrarily close to the equilibrium point in question, but never reaches it! 
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FIGURE 8.3 Typical phase paths for the simple harmonic oscillator equation. Left: No damping. 
Centre: Sub-critical damping. Right: Super-critical damping. 


It follows that the second order equation (8.19) is equivalent to the pair of first order 
equations 


dx 

— = v, 
dt 

dv 

— = F(t) - 2kv - Sl 2 x. 
dt 

We may now apply the theory we have developed to this system of first order ODEs, 
where the phase plane is now the (x, u)-plane. It is clear that driven motion leads to 
a non-autonomous system because of the presence of the explicit time dependence of 
F(t ); undriven motion (in which F(t) = 0) leads to an autonomous system. It is also 
clear that equilibrium points in the (x, u)-plane lie on the jf-axis and correspond to the 
ordinary equilibrium positions of the particle. 

The form of the phase paths for the undriven SI 10 equation 

d 2 x dx t 

— t +2K— + Q 2 x = 0 
dt 2 dt 

depends on the parameters K and £2. We could find these paths by the method used in 
Example 8.1, but there is no point in doing so since we have already solved the equation 
explicitly in Chapter 5. For instance, when K = 0, the general solution is given by 

x = C cos(£2 1 — y), 


from which it follows that 


dx 

v = — = —C £2 sin(Qf — y). 
dt 

The phase paths in the (jc, u)-plane are therefore similar ellipses centred on the origin, 
which is an equilibrium point. This, and two typical cases of damped motion, arc shown 
in Figure 8.3. In the presence of damping, the phase point tends to the equilibrium point 
at the origin as t —> oc. Although the equilibrium point is never actually reached, it is 
convenient to say that these paths ‘terminate’ at the origin. 
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V 



FIGURE 8.4 The phase diagram for the undamped Duffing equation with a 
softening spring. 


Example 8.2 Phase diagram for equation d 2 x/dt 2 + ft 2 * + Ax 3 = 0 

Sketch the phase diagram for the non-linear oscillation equation 

d 2 x/dt 2 + + Ajc 3 = 0, 

when A < 0 (the softening spring). 

Solution 

This equation is equivalent to the pair of first order equations 

dx 

~dt 
dv 

~dt 

which is an autonomous system. The phase paths satisfy the equation 

dv £2 2 x + Ax 3 
dx v 

which is a first order separable ODE whose general solution is 

v 2 = C - Sl 2 x 2 - \Ax 4 , 

where C is a constant of integration. Each positive value of C corresponds to a phase 
path. The phase diagram for the case A < 0 is shown in Figure 8.4. There arc three 
equilibrium points at (0,0), (±^2/|A|^ 2 , 0). The closed loops around the origin 
correspond to periodic oscillations of the particle about x = 0. Such oscillations can 
therefore exist for any amplitude less than £2/|A| l/2 ; this confirms the prediction of 
the energy argument used earlier. Outside this region of closed loops, the paths arc 
unbounded and correspond to unbounded motions of the particle. These two regions 
of differing behaviour are separated by the dashed paths (known as separatriees ) that 
‘terminate’ at the equilibrium points (±£2/|A|*/ 2 , 0). ■ 


= t\ 

= -n 2 x - Ax 3 , 
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FIGURE 8.5 The Poincare-Bendixson theorem. Any bounded phase path of a plane 
autonomous system must either close itself (top left), terminate at an equilibrium 
point (top right), or tend to a limit cycle (normal case bottom left, degenerate case 

bottom right). 


8.4 POINCARE-BENDIXSON THEOREM: LIMIT CYCLES 

In the autonomous systems we have studied so far, those phase paths that are 
bounded either (i) form a closed loop (corresponding to periodic motion), or (ii) ‘termi¬ 
nate’ at an equilibrium point (so that the motion dies away). Figure 8.3 shows examples 
of this. The famous Poincare-Bendixson theorem* which is stated below, says that there 
is just one further possibility. 


Poincare-Bendixson theorem 

Suppose that a phase path of a plane autonomous system lies in a bounded 
domain of the phase plane for t > 0. Then the path must either 

• close itself, or 

• terminate at an equilibrium point as t —► oc, or 

• tend to a limit cycle (or a degenerate limit cycle) as t —*■ oo. 


A proper proof of the theorem is long and difficult (see Coddington & Levinson [9]). 


* After Jules Henri Poincare (1854-1912) and Ivar Otto Bendixson (1861-1935). The theorem was first 
proved by Poincare but a more rigorous proof was given later by Bendixson. 
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The third possibility is new and needs explanation. A limit cycle is a periodic motion 
of a special kind. It is isolated in the sense that nearby phase paths are not closed but are 
attracted towards the limit cycle* ; they spiral around it (or inside it) getting ever closer, as 
shown in Figure 8.5 (bottom left). The degenerate limit cycle shown in Figure 8.5 (bottom 
right) is an obscure case in which the limiting curve is not a periodic motion but has one 
or more equilibrium points actually on it. This case is often omitted in the literature, but 
it definitely exists! 

Proving the existence of periodic solutions 

The Poincare-Bendixson theorem provides a way of proving that a plane autonomous 
system has a periodic solution even when that solution cannot be found explicitly. If a 
phase path can be found that cannot escape from some bounded domain V of the phase 
plane, and if V contains no equilibrium points, then Poincare-Bendixson implies that the 
phase path must either be a closed loop or tend to a limit cycle. In either case, the system 
must have a periodic solution lying in V. The method is illustrated by the following 
examples. 

Example 8.3 Proving existence of a limit cycle 

Prove that the autonomous system of ODEs 

x = x - y - (x 2 + y 2 )x, 
y = * + y - (x 2 + y 2 )y, 


has a limit cycle. 

Solution 

This system clearly has an equilibrium point at the origin x = y = 0, and a little 
algebra shows that there are no others. Although we have not proved this result, it 
is true that any periodic solution (simple closed loop) in the phase plane must have 
an equilibrium point lying inside it. In the present case, it follows that, if a periodic 
solution exists, then it must enclose the origin. This suggests taking the domain V to 
be the annular region between two circles centred on the origin. 

It is convenient to express the system of equations in polar coordinates r ,6. The 
transformed equations are (see Problem 8.5) 

* 1 * 1 +* 2*2 • * 1*2 * 2*1 

r = -, 9 = -:-, 

r r z 

where .vi = r cos# and X 2 = r sin#. In the present case, the polar equations take the 
simple form 


r = r(l — r 2 ), 9=1. 


* This actually describes a stable limit cycle, which is the only kind likely to be observed. 
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These equations can actually be solved explicitly, but, in order to illustrate the method, 
we will make no use of this fact. Let V be the annular domain a < r < b, where 
0 < a < 1 and b > 1. On the circle r = b, r = b( 1 — b 2 ) < 0. Thus a phase point 
that starts anywhere on the outer boundary r = b enters the domain V. Similarly, on 
the circle r = a,r = a(\ — a 2 ) > 0 and so a phase point that starts anywhere on the 
inner boundary r = a also enters the domain V. It follows that any phase path that 
starts in the annular domain V can never leave. Since V is a bounded domain with no 
equilibrium points within it or on its boundaries, it follows from Poincare-Bendixson 
that any such path must either be a simple closed loop or tend to a limit cycle. In 
either case, the system must have a periodic solution lying in the annulus a < r < b. 

We can say more. Phase paths that begin on either boundary of V enter V and 
can never leave. These phase paths cannot close themselves (that would mean leaving 
V) and so can only tend to a limit cycle. It follows that the system must have (at least 
one) limit cycle lying in the domain V. [The explicit solution shows that the circle 
/• = 1 is a limit cycle and that there are no other periodic solutions.! ■ 

Not all examples are as straightforward as the last one. Often, considerable ingenuity 
has to be used to find a suitable domain T>. In particular, the boundary of T> cannot always 
be composed of circles. Most readers will find our second example rather difficult! 

Example 8.4 Rayleigh’s equation has a limit cycle 

Show that Rayleigh’s equation 

X + €X (x 2 - l) +JC = 0, 

has a limit cycle for any positive value of the parameter c. 

Solution 

Rayleigh’s equation arose in his theory of the bowing of a violin string. In the context 
of particle oscillations however, it corresponds to a simple harmonic oscillator with a 
strange damping term. When |i| > 1 , we have ordinary (positive) damping and the 
motion decays. However, when |i| < 1, we have negative damping and the motion 
grows. The possibility arises then of a periodic motion which is positively damped on 
some parts of its cycle and negatively damped on others. Somewhat surprisingly, this 
actually exists. 

Rayleigh’s equation is equivalent to the autonomous system of ODEs 


x = v, 

v = —x — ev (v 2 — l), 


( 8 . 20 ) 


for which the only equilibrium position is at x = v = 0. It follows that, if there is a 
periodic solution, then it must enclose the origin. At first, we proceed as in the first 
example. In polar form, the equations (8.20) become 


r = —er sin 2 6 (r 2 sin 2 6 — l), 

0 = — 1 — c sin 2 6 (r 2 sin 2 6 — l) . 
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FIGURE 8.6 A suitable domain V to show that Rayleigh’s 
equation has a limit cycle. 


Let r = c be a circle with centre at the origin and radius less than unity. Then r > 0 
everywhere on r = c except at the two points x = ±c, v = 0, where it is zero. 
Hence, except for these two points, we can deduce that a phase point that starts on the 
circle r = c enters the domain r > c. Fortunately, these exceptional points can be 
disregarded. It does not matter if there are a finite number of points on r = c where 
the phase paths go the ‘wrong’ way, since this provides only a finite number of escape 
routes! The circle r = c thus provides a suitable inner boundary C\ of the domain V. 

Sadly, one cannot simply take a large circle to be the outer boundary of V since 
r has the wrong sign on those segments of the circle that lie in the strip —1 < v < 1. 
This allows any number of phase paths to escape and so invalidates our argument. 
However, this does not prevent us from choosing a boundary of a different shape. A 
suitable outer boundary for V is the contour C 2 shown in Figure 8.6. This contour is 
made up from four segments. The first segment A B is part of an actual phase path 
of the system which starts at A(—a, 1) and continues as far as B(b, 1). The form 
of this phase path can be deduced from equations (8.21). When v(= r sinfl) > 1, 
r < 0 and 0 < —1, so that the phase point moves clockwise around the origin with 
r decreasing. In particular, B must be closer to the origin than A so that b < a, as 
shown. Similarly, the segment A'B' is part of a second actual phase path that begins at 
A'{a, — 1). Because of the symmetry of the equations (8.20) under the transformation 
x —> — x, v —► —v, this segment is just the reflection of the segment A B in the origin; 
the point B' is therefore (— b , — I). The contour is closed by inserting the straight line 
segments BA' and B'A. 

We will now show that, when C 2 is made sufficiently large, it is a suitable outer 
boundary for our domain V. Consider first the segment AB. Since this is a phase 
path, no other phase path may cross it (in either direction); the same applies to the 
segment A'B'. Now consider the straight segment BA 1 . Because a > b, the outward 
unit normal n shown in Figure 8.6 makes a positive acute angle a with the axis Ox. 
Now the ‘phase plane velocity’ of a phase point is 
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FIGURE 8.7 The body is supported by a 
rough moving belt and is attached to a fixed 
post by a light spring. 
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and the component of this ‘velocity’ in the «-direction is therefore 
vi — lcv(v 2 — 1) + x) j ) • (cos a i + sin a j) 


= vcosa — sin a v(v 2 — 1) + xj 

= —x sin cr + v (cosa + € sin cr( 1 — v 2 )\ 

< — bsina + (1 + e), 

for (,v, u) on BA'. We wish to say that this expression is negative so that phase points 
that begin on BA' enter the domain V. This is true if the contour C 2 is made large 
enough. If we let a tend to infinity, then b also tends to infinity and a tends to 7t/2. 
It follows that, whatever the value of the parameter c, we can make b sin a > (1 + c) 
by taking a large enough. A similar argument applies to the segment B A. Thus 
the contour C 2 is a suitable outer boundary for the domain V. It follows that any 
phase path that starts in the domain V enclosed by C\ and C 2 can never leave. Since 
V is a bounded domain with no equilibrium points within it or on its boundaries, it 
follows from Poincare-Bendixson that any such path must either be a simple closed 
loop or tend to a limit cycle. In either case, Rayleigh’s equation must have a periodic 
solution lying in V. 

We can say more. Phase paths that begin on either of the straight segments of 
the outer boundary C 2 enter V and can never leave. These phase paths cannot close 
themselves (that would mean leaving V) and so can only tend to a limit cycle. It 
follows that Rayleigh’s equation must have (at least one) limit cycle lying in the 
domain T>. |There is in fact only one.] ■ 

A realistic mechanical system with a limit cycle 

Finding realistic mechanical systems that exhibit limit cycles is not easy. Driven oscilla¬ 
tions are eliminated by the requirement that the system be autonomous. Undamped oscil¬ 
lators have bounded periodic motions, and the introduction of damping causes the motions 
to die away to zero, not to a limit cycle. In order to keep the motion going, the system 
needs to be negatively damped for part of the time. This is an unphysical requirement, but 
it can be simulated in a physically realistic system as follows. 

Consider the system shown in Figure 8.7. A block of mass M is supported by a rough 
horizontal belt and is attached to a fixed post by a light linear spring. The belt is made 
to move with constant speed V. Suppose that the motion takes place in a straight line 
and that x(t) is the extension of the spring beyond its natural length at time t. Then the 
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FIGURE 8.8 Left: The form of the frictional resistance function G(u). Right: The limit cycle 
in the phase plane; E is the unstable equilibrium point. 

equation of motion of the block is 

M — = -MQ 2 x - F(v - V), 
dt 

where v — dx/dt, MQ 2 is the spring constant, and F (v) is the frictional force that the 
belt would exert on the block if the block had velocity i> and the belt were at rest ; in the 
actual situation, the argument v is replaced by the relative velocity v — V. The function 
F(v) is supposed to have the form shown in Figure 8.8 (left). Although this choice is 
unusual (F(v) is not an increasing function of v for all v ), it is not unphysical! 

Under the above conditions, the block has an equilibrium position at x = 
F{V)/{MQr). The linearised equation for small motions near this cqulibrium position 
is given by 


d 2 x' . . dx' 

M— T = —MQrx - F\V) — , 
dt 2 dt 

where x' is the displacement of the block from the equilibrium position. If we select the 
belt velocity V so that F'{V) is negative (as shown in Figure 8.8 (left)), then the effective 
damping is negative and small motions will grow. The equilibrium position is therefore 
unstable ; oscillations of the block about the cqulibrium position then do not die out, but 
instead tend to a limit cycle. This limit cycle is shown in Figure 8.8 (right). The formal 
proof that such a limit cycle exists is similar to that for Rayleigh’s equation. Indeed, this 
system is essentially Rayleigh’s model for the bowing of a violin siring, where the belt is 
the bow, and the block is the string. 

Chaotic motions 

Another important conclusion from Poincarc-Bcndixson is that no bounded motion of a 
plane autonomous system can exhibit chaos. The phase point cannot just wander about 
in a bounded region of the phase plane for ever. It must either close itself, terminate at 
an equilibrium point, or tend to a limit cycle and none of these motions is chaotic. In 
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particular, no bounded motion of an undriven non-linear oscillator can be chaotic. As we 
will see in the next section however, the driven non-linear oscillator (a non-autonomous 
system) can exhibit bounded chaotic motions. 

It should be remembered that Poincare-Bendixson applies only to the bounded motion 
of plane autonomous systems. If the phase space has dimension three or more, then other 
motions, including chaos, are possible. 


8.5 DRIVEN NON-LINEAR OSCILLATIONS 


Suppose that we now introduce damping and a harmonic driving force into equa¬ 
tion (8.2). This gives 


d 2 x 
dt 2 



-|- £2 2 x + Ax 3 


Fq cos pt. 


( 8 . 22 ) 


which is known as Duffing’s equation. 

The presence of the driving force Fq cos pt makes this system non-autonomous. 
The behaviour of non-autonomous systems is considerably more complex than that of 
autonomous systems. Phase space is still a useful aid in depicting the motion of the sys¬ 
tem, but little can be said about the general behaviour of the phase paths. In particular, 
phase paths can cross each other any number of times, and Poincare-Bendixson does not 
apply. Our treatment of driven non-linear oscillations is therefore restricted to perturbation 
theory. 

In view of the large number of parameters, it is sensible to non-dimensionalise equa¬ 
tion (8.22). The dimensionless displacement X is defined by x = ( Fq/p 2 )X and the 
dimensionless time s by s — pt. The function X (s) then satisfies the dimensionless equa¬ 
tion 


X" + (p) X ' + X +€X * = coss ' 

where the dimensionless parameter e is defined by 

F 2 A 
r° a 

£> 6 ‘ 


(8.23) 


(8.24) 


When € = 0, equation (8.23) reduces to the linear problem. This suggests that, when 
€ is small, we may be able to find approximate solutions by perturbation theory. The 
linear problem always has a periodic solution for X (the driven motion) that is harmonic 
with period 2n. Proving the existence of periodic solutions of Duffing’s equation is an 
interesting and difficult problem. Here we address this problem for the case in which c 
is small, a regular perturbation on the linear problem. To simplify the working we will 
suppose that damping is absent; the general features of the solution remain the same. The 
governing equation (8.23) then simplifies to 


X" + 



X -MX 3 


COS .V. 


(8.25) 
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Initial conditions do not come into this problem. We are simply seeking a family of 
solutions X (s, c), parametrised by c, that are (i) periodic, and (ii) reduce to the linear 
solution when c = 0. We need to consider first the periodicity of this family of solutions. 
In the non-linear problem, we have no right to suppose that the angular frequency of the 
driven motion is equal to that of the driving force, as it is in the linear problem; it could 
depend on c. However, suppose that the driving force has minimum period to and that a 
family of solutions X ( 5 , e)) of equation (8.25) exists with minimum period r (= r(6)). 
Then, since the derivatives and powers of X also have period r, it follows that the left 
side of equation (8.25) must have period r. The right side however has period ro and this 
is known to be the minimum period. It follows that r must be an integer multiple of To; 
note that r is not compelled to be equal to ro.* However, in the present case, the period 
r(e) is supposed to be a continuous function of € with r = to when e = 0. It follows 
that the only possibility is that r = tq for all c. Thus the period of the driven motion 
is independent of c and is equal to the period of the driving force. This argument leaves 
open the possibility that other driven motions may exist that have periods that are integer 
multiples of ro. However, even if they exist, they cannot occur in our perturbation scheme. 

We therefore expand X (s, c) in the perturbation series 

X(t,e) = X 0 (f) +eX,(t) + e 2 X 2 (t) + • • • , (8.26) 

and seek a solution of equation (8.25) that has period 2 n. It follows that the expansion 
functions XoCs), Xi(s), X 2 (s), ... must also have period 27T. If we now substitute this 
series into the equation (8.25) and equate coefficients of powers of e, we obtain a succes¬ 
sion of ODEs the first two of which are as follows: 

Prom coefficients of 6°: 


X 0 4 - y— J Xo = cos.v. 


(8.27) 


From coefficients of c 1 ; 


x'; + (~) x, = -x 


p ) 


(8.28) 


For p ^ Q,, the general solution of the zero order equation (8.27) is 


*o = 


P 


&-P 2 


cos s + A cos (Q.S/p) + B sin (£ls/p). 


where A and B are arbitrary constants. Since Xo is known to have period 2tt, it follows 
that A and B must be zero unless £2 is an integer multiple of p; we will assume this is not 


* The fact that r is the minimum period of X does not neccessarily make it the minimum period of the left 
side of equation (8.25). 
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the case. Then the required solution of the zero order equation is 

„2 


X 0 = 


Q 2 - p 2 


cos s. 


(8.29) 


The first order equation (8.28) can now be written 


Xl = ~ 


P 


Q 2 -p 2 
P 6 

- p 2 ) 


cos 3 s 


j I (3 cos s + cos 3.5), 


(8.30) 


on using the trigonometric identity cos 35 = 4cos 3 5 — 3 cos 5 . Since £2/p is not an integer, 
the only solution of this equation that has period 2tt is 


X\ = — 


3 cos 5 cos 

+ 


4(£2 2 — p 2 ) 2 f \Qr — p 2 £2 2 


:os 35 \ 

^9^)' 


(8.31) 


Results 

When € (= Fq A/p 6 ) is small, the driven response of the Duffing equation (8.22) (with 
k = 0) is given by 




Fo 


Q 2 - “ 2 


P- 


cos pt — 


3p 6 cos pt » u cos3pf \ / 9 \ 

<a 2 - p 2 ) 3 + <a 2 -p 2 ) 2 (a 2 -9p 2 )) e + 0 v ) 


p 6 cos 3 pt 


This is the approximate solution correct to the first order in the small parameter e. 
More terms can be obtained in a similar way but this is best done with computer assistance. 

The most interesting feature of this formula is the behaviour of the first order cor¬ 
rection term when £2 is close to 3 p, which suggests the existence of a super-harmonic 
resonance with frequency 3 p. Similar ‘resonances’ occur in the higher tenns at the fre¬ 
quencies 5 p. Ip,..., and are caused by the presence of the non-linear term A.v 3 . It should 
not however be concluded that large amplitude responses occur at these frequencies.* The 
critical case in which Q = 3 p is solved in Problem 8.14 and reveals no infinities in the 
response. 


Sub-harmonic responses and chaos 

We have so far left open the interesting question of whether a driving force with minimum 
period r can excite a subharmonic response, that is, a response whose minimum period is 


* This is a subtle point. Like all power series, perturbation series have a certain ‘radius of convergence’. 
When all the terms of the perturbation series are included, e is resticted to some range of values —€q < 
e < eo- What seems to happen when S2 approaches 3 p is that eo approaches zero so that the first order 
correction term never actually gets large. 
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FIGURE 8.9 Two different periodic responses to the same driving force. Left: A 
response of period 2 n. Right: A sub-harmonic response of period 4 n. 


an integer multiple of r. This is certainly not possible in the linear case, where the driving 
force and the induced response always have the same period. One way of investigating 
this problem would be to expand the (unknown) response x(t ) as a Fourier series and 
to substitute this into the left side of Duffing’s equation. One would then require all the 
odd numbered terms to magically cancel out leaving a function with period 2r. Unlikely 
though this may seem, it can happen! There are ranges of the parameters in Duffing’s 
equation that permit a sub-harmonic response. Indeed, it is possible for the same set 
of parameters to allow more than one periodic response. Figure 8.9 shows two different 
periodic responses of the equation d 2 x /dt 2 + kdx/dt + x 3 = A cos t , each corresponding 
to k = 0.04, A — 0.9. One response has period 2n while the other is a subharmonic 
response with period 4tt. Which of these is the steady state response depends on the 
initial conditions. It is also possible for the motion to be chaotic with no steady state ever 
being reached, even though damping is present. 


Problems on Chapter 8 

Answers and comments are at the end of the book. 
Harder problems carry a star (*). 


Periodic oscillations: Lindstedt’s method 

8.1 A non-linear oscillator satisfies the equation 

(l +ex 2 jx +x = 0, 

where e is a small parameter. Use Linstedt’s method to obtain a two-term approximation to 
the oscillation frequency when the oscillation has unit amplitude. Find also the corresponding 
two-term approximation to jc(r). [You will need the identity 4cos 3 s = 3coss + cos 3s.] 

8.2 A non-linear oscillator satisfies the equation 


x + x + ex 5 = 0, 
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where e is a small parameter. Use Linstedt’s method to obtain a two-term approximation to 
the oscillation frequency when the oscillation has unit amplitude. |You will need the identity 
16 cos 5 s= IOcoss + 5cos3s + cos5s.] 

8.3 Unsymmetrical oscillations A non-linear oscillator satisfies the equation 

x + jc + cx 2 = 0, 

where e is a small parameter. Explain why the oscillations arc unsymmetrical about x = 0 in 
this problem. 

Use Linstedt’s method to obtain a two-term approximation to x(t) for the oscillation in 
which the maximum value of x is unity. Deduce a two-term approximation to the minimum 
value achieved by .v(/) in this oscillation. 

8 . 4 * A limit cycle by perturbation theory Use perturbation theory to investigate the limit 
cycle of Rayleigh’s equation, taken here in the form 

x + € (yi 2 — 1^ x + x = 0, 

where e is a small positive parameter. Show that the zero order approximation to the limit 
cycle is a circle and determine its centre and radius. Find the frequency of the limit cycle 
correct to order e 2 , and find the function *(/) correct to order e. 

Phase paths 

8 .5 Phase paths in polar form Show that the system of equations 

ii = F\{x\,x 2 , t), x 2 = F 2 (x\, x 2 ,t) 

can be written in polar coordinates in the form 

. +*2^2 x X\F 2 ~x 2 Fi 

r = -, 0 = -r-, 

/• r z 

where ,vi = r cos 6 and x 2 = r sin 0. 

A dynamical system satisfies the equations 

x = -x + y, 
y = -X - y. 

Convert this system into polar form and find the polar equations of the phase paths. Show that 
every phase path encircles the origin infinitely many times in the clockwise direction. Show 
further that every phase path terminates at the origin. Sketch the phase diagram. 

8.6 A dynamical system satisfies the equations 

x = x - y - (x 2 + y 2 )x, 

}' = x + y - (x 2 + y 2 )y. 
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Convert this system into polar form and find the polar equations of the phase paths that begin 
in the domain 0 < /• < 1. Show that all these phase paths spiral anti-clockwise and tend to 
the limit cycle r = 1. Show also that the same is true for phase paths that begin in the domain 
/• > 1. Sketch the phase diagram. 

8.7 A damped linear oscillator satisfies the equation 

x + x + x = 0. 

Show that the polar equations for the motion of the phase points are 

r = —r sin 2 0, 6 = — fl + j sin 20^ . 

Show that every phase path encircles the origin infinitely many times in the clockwise direc¬ 
tion. Show further that these phase paths terminate at the origin. 

8.8 A non-linear oscillator satisfies the equation 


x + i 3 + .v = 0. 


Find the polar equations for the motion of the phase points. Show that phase paths that begin 
within the circle r < 1 encircle the origin infinitely many times in the clockwise direction. 
Show further that these phase paths terminate at the origin. 

8.9 A non-linear oscillator satisfies the equation 

x + (x 2 + x 2 - 1) x + x = 0. 

Find the polar equations for the motion of the phase points. Show that any phase path that 
starts in the domain 1 < r < y/3 spirals clockwise and tends to the limit cycle r = 1. [The 
same is true of phase paths that start in the domain 0 < r < l.| What is the period of the limit 
cycle? 

8.10 Predator-prey Consider the symmetrical predator—prey equations 

x = x - xy, y = X y - y, 

where x(t) and y(t) are positive functions. Show that the phase paths satisfy the equation 

(*c“' v ) (\e~ y ) = A, 

where A is a constant whose value determines the particular phase path. By considering the 
shape of the surface 


z = (xe x ) (yc > ), 

deduce that each phase path is a simple closed curve that encircles the equilibrium point at 
(1, 1). Hence every solution of the equations is periodic! [This prediction can be confirmed 
by solving the original equations numerically.] 
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Poincare-Bendixson 


8.11 Use Poincare-Bendixson to show that the system 


i = jr - y - (x 2 + 4y 2 )x, 
y = x + y - (x 2 + 4 y 2 )y, 


has a limit cycle lying in the annulus \ < r < I. 


8.12 Van der Pol’s equation Use Poincare-Bendixson to show that Van der Pol’s equation* 



has a limit cycle for any positive value of the constant e. [The method is similar to that used 
for Rayleigh’s equation in Example 8.4.] 

Driven oscillations 

8.13 A driven non-linear oscillator satisfies the equation 


•• • 3 

x + a + x = cos pt, 


where e, p are positive constants. Use perturbation theory to find a two-term approximation 
to the driven response when e is small. Are there any restrictions on the value of pi 

8.14 Super-harmonic resonance A driven non-linear oscillator satisfies the equation 


x + 9x + ex 3 = cos/, 


where c is a small parameter. Use perturbation theory to investigate the possible existence of 
a superharmonic resonance. Show that the zero order solution is 


x 0 = - (cos / + ao cos 3/), 

O 


where the constant ao is a constant that is not known at the zero order stage. 

By proceeding to the first order stage, show that ao is the unique real root of the cubic 
equation 


3<7q + Gao +1 — 0, 


which is about —0.164. Thus, when driving the oscillator at this sub-harmonic frequency, the 
non-linear correction appears in the zero order solution. However, there are no infinities to be 
found in the perturbation scheme at this (or any other) stage. 

Plot the graph of xq(/) and the path of the phase point (xq(/), Xq(/)). 


* After the extravagantly named Dutch physicist Balthasar Van der Pol (1889-1959). The equation arose 
in connection with the current in an electronic circuit. In 1927 Van der Pol observed what is now called 
deterministic chaos, but did not investigate it further. 


Copyrighted material 



218 


Chapter 8 Non-linear oscillations and phase space 


Computer assisted problems 

8.15 Linstedt’s method Use computer assistance to implement Lindstedt’s method for the 
equation 


Jr + ;r + a 3 = 0. 


Obtain a three-term approximation to the oscillation frequency when the oscillation has unit 
amplitude. Find also the corresponding three-term approximation to x(t). 

8.16 Van der Pol’s equation A classic non-linear oscillation equation that has a limit cycle 
is Van dcr Pol’s equation 



where e is a positive parameter. Solve the equation numerically with c = 2 (say) and plot the 
motion of a few of the phase points in the (x, t>)-plane. All the phase paths tend to the limit 
cycle. One can see the same effect in a different way by plotting the solution function x{t) 
against t. 

8.17 Sub-harmonic and chaotic responses Investigate the steady state responses of the 
equation 


Jr + kx + x 3 = A cos t 


for various choices of the parameters k and A and various initial conditions. First obtain the 
responses shown in Figure 8.9 and then go on to try other choices of the parameters. Some 
very exotic results can be obtained! For various chaotic responses try = 0.1 and A = 7. 
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Chapter Nine 


The energy principle 

and energy conservation 


KEY FEATURES 

The key features of this chapter are the energy principle for a multi-particle system, the poten 
tial energies arising from external and internal forces, and energy conservation. 


This is the first of three chapters in which we study the mechanics of multi-particle 
systems. This is an important development which greatly increases the range of problems 
that we can solve. In particular, multi-particle mechanics is needed to solve problems 
involving the rotation of rigid bodies. 

The chapter begins by obtaining the energy principle for a multi-particle system. This 
is the first of the three great principles of multi-particle mechanics* that apply to every 
mechanical system without restriction. We then show that, under appropriate conditions, 
the total energy of the system is conserved. We apply this energy conservation principle 
to a wide variety of systems. When the system has just one degree of freedom, the energy 
conservation equation is sufficient to determine the whole motion. 

9.1 CONFIGURATIONS AND DEGREES OF FREEDOM 

A multi-particle system S may consist of any number of particles P\, Pi,..., 
Pn, with masses m\, m 2 ..., niN respectively. A possible ‘position’ of the system is 
called a configuration. More precisely, if the particles P\, Pi,..., Pn of a system have 
position vectors r\,ri,...,r^, then any geometrically possible set of values for the posi¬ 
tion vectors {r, } is a configuration of the system. 

If the system is unconstrained, then each particle can take up any position in space 
(independently of the others) and all choices of the {/*, } are possible. This would be the 
case, for instance, if the particles of S were moving freely under their mutual gravita¬ 
tion. On the other hand, when constraints are present, the {r, } are restricted. Suppose 
for instance that the particles P\ and Pi arc connected by a light rigid rod of length a. 


* The other two are the linear momentum and angular momentum principles. 

T To save space, we will usually express this by saying that S is the system of particles {P,} with masses 
{w,}, the range of the index number / being understood to be 1 < / < /V. 
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FIGURE 9.1 The multi-particle system 5 
consists of N particles P\, P 2 , .. Pn, of 
which the typical particle Pi is labelled. The 
particle P t has mass /?//, position vector r/, 
and velocity Vj. 



FIGURE 9.2 The generalised coordinates A' 
and 0 are sufficient to specify the 
configuration of this two-particle system in 
planar motion. 


x —- ^1 



This imposes the geometrical restriction |i*i — ri\ = a so that not all choices of the {/*, } 
are then possible. This difference is reflected in the number of scalar variables needed 
to specify the configuration of <S. In the unconstrained case, all of the position vectors 
{/*,} must be specified separately. Since each of these vectors may be specified by three 
Cartesian coordinates, it follows that a total of 3 N scalar variables are needed to specify 
the configuration of an unconstrained N -particle system. When constraints are present, 
this number is reduced, often dramatically so. 

For example, consider the system shown in Figure 9.2, which consists of two particles 
P\ and P 2 connected by a light rigid rod of length a. The particle P\ is also constrained 
to move along a fixed horizontal rail and the whole system moves in the vertical plane 
through the rail. The two scalar variables x and 6 shown are sufficient to specify the 
configuration of this system. This contrasts with the six scalar variables that would be 
needed if the two particles were in unconstrained motion. The variables x and 9 arc said 
to be a set of generalised coordinates for this system.* Other choices for the generalised 
coordinates could be made, but the number of generalised coordinates needed is always 
the same. 

Definition 9.1 Degrees of freedom The number of generalised coordinates needed to 
specify the configuration of a system <5 is called the number of degrees of freedom of S. 

Importance of degrees of freedom 

The number of degrees of freedom of a system is important because it is equal to the 
number of equations that are needed to determine the motion of the system. For example, 


Besides being sufficient to specify the configuration of the system, the generalised coordinates are also 
required to be independent, that is, there must be no functional relation between them. The coordinates 
.v, 6 in Figure 9.2 are certainly independent variables. If the coordinates were connected by a functional 
relation, they would not all be needed and one of them could be discarded. 
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FIGURE 9.3 The multi-particle S consists of 
N particles P\ , P 2 , .. Pn, of which the 
typical particles Pj and P, are shown 
explicitly. The force F, is the external force 
acting on Pi and the force G/y is the internal 
force exerted on P, by the particle Pj. 



the system shown in Figure 9.2 has two degrees of freedom and so needs two equations to 
determine the motion completely. 

Example 9.1 Degrees of freedom 

Find the number of degrees of freedom of the following mechanical systems: (i) the 
simple pendulum (moving in a vertical plane), (ii) a door swinging on its hinges, (iii) 
a bar of soap (a particle) sliding on the inside of a hemispherical basin, (iv) a rigid 
rod sliding on a flat table, (v) four rigid rods flexibly jointed to form a quadrilateral 
which can slide on a flat table. 

Solution 

(i) 1 (ii) 1 (iii) 2 (iv) 3 (v) 4. ■ 

9.2 THE ENERGY PRINCIPLE FOR A SYSTEM 

Let S be a system of N particles {f*, }, as shown in Figure 9.3. We classify the 
forces acting on the particles of S as being external or internal. External forces are those 
originating from outside S. (In the case of a single particle, these are the only forces 
that act.) Uniform gravity is an example of an external force. However, in multi-particle 
systems, the particles are also subject to their own mutual interactions, that is, the forces 
that they exert upon each other. These mutual interactions are called the internal forces 
acting on S. The situation is shown in Figure 9.3. F,- is the external force acting on the 
particle Pj, while G,y is the internal force exerted on P, by the particle Pj. By the Third 
Law, the force Gy,- that P, exerts on Pj must be equal and opposite to the force G/y, and 
both forces must be parallel to the straight line joining Pj and Pj. In short, the {G,y} must 
satisfy 


Gy,- = -G/y, and G/y || (r, - ry). 


(9.1) 
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To obtain the energy principle for the system S, we proceed in the same way as we did for 
a single particle in section 6.1. The equation of motion for the particle F, is* 


nij 


dVj 

~dt 


N 

Fi + T, G ‘J- 


7 = 1 


(9.2) 


where v,- is the velocity of P, at time t. On taking the scalar product of both sides of 
equation (9.2) with v, and then summing the result over all the particles (1 < i < N), we 
obtain 


(IT 

~dt 


E 


i=l 


N 

Fi + E G ‘j 


7 = 1 


(9.3) 


where 


» = 1 

the total kinetic energy of the whole system <S. Suppose that, in the time interval [ t A , ts), 
the system S moves from configuration A to configuration B. On integrating equation 
(9.3) with respect to t over the time interval [Ot, t&\ we obtain 


N r, B N N , e 

Tb-T A = J2 v i dt + J2 E / G ‘j ' v i dt 

i=i ^ i=ij=i Jt * 


(9.4) 


where T A and T& are the kinetic energies of the system S at times t A and /& respectively. 
This is the energy principle for a multi-particle system moving under the external forces 
{F;} and internal forces {G,j }. This impressive looking result can be stated quite simply 
as follows: 


Energy principle for a multi-particle system 

In any motion of a system, the increase in the total kinetic energy of the system in 
a given time interval is equal to the total work done by all the external and internal 
forces during this time interval. 


The summation over j in equation (9.2) contains the term G,,- which corresponds to the force that the 
particle Pj exerts upon itself. Since such a force is not actually present, we should really say that the 
summation is over the range 1 < j < N with j i. Since this would make the formulae look messy, we 
adopt the device of regarding the terms G\\, G 22 ,..., G^n (which do not actually exist) as being zero. 
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9.3 ENERGY CONSERVATION FOR A SYSTEM 

In order to develop an energy conservation principle, we need to write the right 
side of the energy principle (9.4) in the form V (.4) — V (B), where V is the potential 
energy function for the whole system. We first consider unconstrained systems. 

Unconstrained systems 

When the system is unconstrained, all the forces that act on the system are specified 
directly. We will assume that the external forces F; are conservative fields. In this case 
Fi = — grad0;, where 0, is the potential energy function of the field F,-. Then the total 
work done by the external forces can be written 

N 

F ( - - v, dt = ^ (<Pi(rA) ~ <Pi(rB )) = <t>(.4) - 



where 


<D(ri, r 2 ,.... r/y) = 0i(ri) + fofo) H- + 4>N(r N ) 

is the potential energy of S arising from the external forces. 

Example 9.2 Potential energy under uniform gravity 

Find the potential energy <f> when the external forces on S arise from uniform gravity. 

Solution 

Under uniform gravity, the force F,- exerted on particle F, is F, = —nijgk, where the 
unit vector k points vertically upwards. This conservative field has potential energy 
0 , = nijgZi, where z; is the z-coordinate of P-,. The total potential energy of S due to 
uniform gravity is therefore 

= III \gZ\ + lll2gZ2 + • • • + HINgZ/y. 

On using the definition of centre of mass given in section 3.5, this can be written in 
the alternative form 


<D = MgZ, 

where M is the total mass of S , and Z is the z-cordinatc of the centre of mass of S. 
Thus the potential energy of any system due to uniform gravity is the same as if all its 
mass were concentrated at its centre of mass. ■ 

We now need to make a similar transformation to show that the work done by the 
internal forces can be written in the form ^(^4) — vF(£), where 4* is the internal potential 
energy. The argument is as follows: 

We know from the Third Law that the {G,y} satisfy the conditions (9.1), but a little more must be 
assumed. We further assume that the magnitude of G,j depends only on v,j , the distance between /', and 
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Pj .* Internal forces that satisfy this conditions will be called conservative; mutual gravitation forces are a 
typical example. Hence, when the internal forces are conservative, Gjj must have the form 

G u =h ij (r i j)n J (9.5) 

where (see Figure 9.3) 

r ij = r i ~ r j I'ij = | r,j | ?ij = r lJ /r ij . (9.6) 

Note that hjj is the repulsive force that the particles Pj and Pj exert upon each other. 

Consider now the rate of working of the pair of forces Gjj and Gjj . This is 



on using equations (9.1), (9.6) and the identity r/y • r\j = t’iji'ij- The total work done by the forces Gjj 
and Gjj during the time interval tg] is therefore 




hi j ( r i j ) dr j j = Hjj(rjj(A)) - tfy(r y (B)), 


where Hjj is the indefinite integral of —hjj. The function H l} (tjj ) is called the mutual potential energy 
of the particles Pj and Pj . 

It follows that the total work done by all the internal forces in the time interval [ ttg] can be written 
in the form 


N N 


ee/‘ 

, = 1 7 = 1 Jl A 


’IJ 


Vjdt = - V(B), 


where 


N i -1 

^{r\,r 2 . H u( r ij) 

i=\j=\ 


is the potential energy of S arising from the internal forces. This potential energy is just the sum of the 
mutual potential energies of all pairs of particles. 


Example 9.3 Internal energy of three charged particles 

Three particles P\, P 2 , P 3 carry electric charges e\, e 2 , e 2 respectively. Find the 
internal potential energy ^1>. 

Solution 

In cgs/clcctrostatic units, the particles P\ and P 2 repel each other with the force 
h 12 (>' 12 ) = where r \ 2 is the distance between P\ and P 2 . Their mutual 

potential energy is therefore 

/ [ C\C2 £\ e 2 

f‘ 12 ( 1 ' 12 ) dr 12 =— I - — dr \ 2 = -. 

J (r 12) Z r 12 


* This is equivalent to the very reasonable assumptions that the magnitude of Gjj is invariant under spatial 
translations and rotations of each pair of particles Pj and Pj, and is independent of the time. 
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The internal potential energy of the whole system is therefore 


e\?2 e 1^3 e 2 e 3 


>‘\2 r \3 >'23 


On combining the above results, the energy principle (9.4) can be written 


T b -T a = V(A)-V(B), 


where V = <t> + is the total potential energy of the system <S. This is equivalent to the 
energy conservation formula 


(9.7) 


T + V = E 


where E is the total energy of the system.This result can be summarised as follows: 


Energy conservation for an unconstrained system 

When both the external and internal forces acting on a system are conservative, the 
sum of its kinetic and potential energies* remains constant in the motion. 


Example 9.4 A star with two planets 

A star of very large mass M is orbited by two planets P\ and Pi of masses in \ and 
m 2 . Find the energy conservation equation for this system. 


Solution 


Since the mass of the star is supposed to be very much larger than the planetary 
masses, we will neglect its motion and suppose that it is fixed at the origin O . We 
then have a two-particle problem in which the planets move under the (external) grav¬ 
itational attraction of the star and their (internal) mutual gravitational interaction. This 
is an unconstrained system. 

The total potential energy arising from external forces is then 


Mm\G M 1112 G 


ct> = 


where r\f 2 are the distances 0P\,0 Pi. 

The particles P\ and P 2 repel each other with the force h 12 (^ 12 ) = 

— m i/;i2G/(ri2) 2 , where /• 12 is the distance between P\ and P 2 . Their mutual poten¬ 
tial energy is therefore 



and this is the only contribution to the internal potential energy M'. 


The potential energy is the total of the potential energies arising from both the external and internal forces. 
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Since the system is unconstrained and the external and internal forces are con¬ 
servative, energy conservation applies. The energy conservation equation for the 

system is 




»t 


i o (ni \ ///2 \ tnxniiG 

+ \m 2 |i > 2 | 2 - MG — + — ) - 

\ '1 r 2 / H2 


= E, 


where v \, v 2 are the velocities of the planets P \, P 2 , and E is the constant total energy. 
The value of E is determined from the initial conditions. 

Since this system has six degrees of freedom (four if the motions are confined 
to a plane through O), the energy conservation equation is by no means sufficient to 
determine the motion! ■ 


Question Can a planet escape? 

If the initial conditions arc such that E < 0, is it possible for a planet to escape to 
infinity? 


Answer 

If E <0, then it is certainly not possible for both planets to escape to infinity, since 
the total energy would then be positive. However, the escape of one planet is not 
prohibited by energy conservation. This does not mean however that such an escape 
will actually happen. 

Constrained systems 

When a system is subject to constraints, not all the forces that act on the system are speci¬ 
fied. This is because constraints are enforced by constraint forces that are not part of the 
specification of the problem; all we know is that their effect is to enforce the given con¬ 
straints. The work done by constraint forces cannot generally be calculated (or expressed 
in terms of a potential energy) and we are restricted to those systems for which the total 
work done by the constraint forces happens to be zero* 

The constraint forces acting on the system may be external (for example, when a 
particle of the system is constrained to remain at rest), or internal (for example, when two 
particles of the system are constrained to remain the same distance apart). 

A The list of external constraint forces that do no work is the same as that given in 
Section 6.5 for single particle motion. 

B The most important result regarding internal constraint forces that do no work is 
this: The total work done by any pair of mutual interaction forces is zero when 
the particles on which they act are constrained to remain a fixed distance apart. 
The proof is as follows: 

Suppose two particles P, and Pj arc constrained to remain a fixed distance apart and that their 
mutual interaction forces are G,y and Gy, (see Figure 9.3). Since the distance between P; and Pj 
is constant, it follows that (r, — r j) • (r,- — r j) is constant, which, on differentiating with respect 


* Individual constraint forces may do work. 
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FIGURE 9.4 The particles Q and R slide along a smooth horizontal rail 
while the particle P moves vertically. 


to t, gives 


in - rj) • (Vi - Vj) = 0. 

Thus the vector (v; — vj) must be perpendicular to the straight line joining P\ and Pj . Hence, 
the rate of working of the two forces Gjj and Gjj is 

G,j • Vj + Gjj • vj = Gjj • (vj - • Vj) = 0, 


since Gjj is known to be parallel to the straight line joining P, and Pj . Thus the internal con¬ 
straint forces Gjj and Gjj do no work in total. 

It follows, for example, that the two tension forces exerted by a light inextensible 
string do no work in total. It further follows that the internal forces that enforce 
rigidity in a rigid body do no work in total. This important result allows us to 
solve rigid body problems by energy methods. 


Our result for constrained systems can be summarised as follows: 


Energy conservation for a constrained system 

When the specified external and internal forces acting on a system are conservative , 
and the constraint forces do no work in total , the sum of the kinetic and potential 
energies of the system remains constant in the motion. 


Example 9.5 A constrained three-particle system 

Figure 9.4 shows a ball P of mass hit suspended by light inextensible strings of 
length a from two sliders Q and R, each of mass in, which can move on a smooth 
horizontal rail. The system moves symmetrically so that O, the mid-point of Q and 
R , remains fixed and P moves on the downward vertical through O. Initially the 
system is released from rest with the three particles in a straight line and with the 
strings taut. Find the energy conservation equation for the system. 

Solution 

This is a system with one degree of freedom and we take the angle 0 as the generalised 
coordinate. Let z and x be the displacements of the particles P and Q from the fixed 
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point O. Then, in terms of the generalised coordinate 0 ,x = a cos # and z = a sin#. 
Differentiating these formulae with respect to t then gives 

x = — {a sin#)#, z = (acos#)#. 

Hence the total kinetic energy of the system is given by 

T = ^(2///)z 2 + jinx 2 + jinx 2 = ina 2 0 2 . 

The only contribution to the potential energy comes from uniform gravity, so 

that 


V = —(2 m)gz + 0 + 0 = —2 mga sin#, 


where we have taken the zero level of potential energy to be at the rail. 

We must now show that the constraint forces do no work. The reactions exerted 
by the smooth rail on the particles Q and R are perpendicular to the rail and therefore 
perpendicular to the velocities of Q and R ; these reactions therefore do no work. Also, 
the tension forces exerted by the inextensible strings do no work in total. Hence, the 
constraint forces do no work in total. 

Energy conservation therefore applies in the form 

ma 2 0 2 — 2 mga sin 0 — E. 


From the initial conditions # = # = 0 when t = 0, it follows that E = 0. The energy 
conservation equation for the system is therefore 

# 2 - — sin# = 0. ■ 
a 

Question When do the sliders collide? 

Find the time that elapses before the sliders collide. 

Answer 

Since this system has only one degree of freedom, the motion can be found from 
energy conservation alone. From the energy conservation equation, it follows that 

d0 ( 2 ? \ l/2 

u = ± {i) <■*>•>'* 

and, since # is an increasing function of t, we take the positive sign. This equation is 
a first order separable ODE. 

Since the sliders collide when 0 = tt/ 2, the time r that elapses is given by 


/ a \ 1/2 C 71 / 2 dO 

\2#/ Jo (sin#)'/ 2 
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Initially 


After time t x 


v 


FIGURE 9.5 A uniform rope is released from rest hanging over the edge of a smooth table (left). 

After time t it has displacement x (right). 

Example 9.6 Rope sliding off a table 

A uniform inextensible rope of mass M and length a is released from rest hanging 
over the edge of a smooth horizontal table, as shown in Figure 9.5. Find the speed of 
the rope when it has the displacement x shown. 


Solution 


A rope is a continuous distribution of mass, unlike the discrete masses that appear in 
our theory. We regard the rope as being represented by a light inextensible string of 
length a with N particles, each of mass M/N, attached to the string at equally spaced 
intervals along its length. When N is very large, we expect this discrete set of masses 
to approximate the behaviour of the rope. 

Since each particle of the rope has the same speed v (= x), the total kinetic 
energy of the rope is simply 


T = \Mv 2 . 


The only contribution to the potential energy comes from uniform gravity. If we 
take the reference state for V to be the initial configuration (Figure 9.5 (left)), then 
the potential energy in the displaced configuration (right) is the same as if a length x 
of the rope lying on the table were cut off and this piece were then suspended from 
the hanging end. In the continuous limit (that is, as N —» oo), this piece of rope has 
mass Mx/a and its centre of mass is lowered a distance b + ( x/2 ) by this operation. 
The potential energy of the rope in the displaced configuration is therefore 



We must now show that the constraint forces do no work. The reactions exerted 
by the smooth table on the particles of the rope are always perpendicular to the veloc¬ 
ities of these particles; these reactions therefore do no work. Also, the tension forces 
exerted by each segment of the inextensible string (connecting adjacent particles of 
the rope) do no work in total. Hence, the constraint forces do no work in total. 

Energy conservation therefore applies in the form 
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The initial condition v = 0 when x = 0 implies that E = 0. The energy equation for 
the rope is therefore 


v 2 — — x(x + 2b). 
a 

This gives the speed of the rope when it has displacement .v. This formula holds while 
there is still some rope left on the the table top. ■ 

Note. In the above solution we have assumed that the rope follows the contour of the 
table edge and then falls vertically. However, it can be shown that this cannot be true 
when the rope is close to leaving the table. What actually happens is that the end of 
the rope overshoots the table edge. This is a tricky point which we will not investigate 
further. 

Question Displacement at time t 

Find the displacement of the rope at time t. 

Answer 

Since this system has only one degree of freedom, the motion can be found from 
energy conservation alone. From the energy conservation equation, it follows that 

— = ±nx'l\x + 2b)' n , 
dt V 

where n 2 = g/a. Since x is an increasing function of t, we take the positive sign. 
This equation is a first order separable ODE. 

It follows that 


f dx 

nt ~ J T^H^VW 12 

„ . / JC \ 1/2 

= 2 s ,nh-(-) + C, 

on using the substitution x = lb sinh" w. The initial condition x = 0 when t = 0 
implies that C = 0 and, after some simplification, we obtain 

x = b(cosh nt — 1) 

as the displacement of the rope after time t. As before, this formula holds while there 
is still some rope left on the the table top. ■ 

Example 9.7 Stability of a plank on a log 

A uniform thin rigid plank is placed on top of a rough circular log and can roll without 
slipping. Show that the equilibrium position, in which the plank rests symmetrically 
on top of the log, is stable. 
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FIGURE 9.6 A thin uniform plank is placed 
symmetrically on top of a fixed rough 
circular log. Is the equilibrium position of the 
plank stable? 



Solution 

Suppose that the plank is disturbed from its equilibrium position and is tilted by an 
angle # as shown in Figure 9.6. The plank is known to roll on the log, which means 
that the distance GC from the centre G of the plank to the contact point C must 
always be equal to the arc length of the log that has been traversed. If the radius of 
the log is a . then this arc length is aO. 

We are not yet able to calculate the kinetic energy of the plank in terms of the 
coordinate #. This is done in the next section. However, we do not need it to investi¬ 
gate stability. 

The only contribution to the potential energy of the plank comes from uniform 
gravity. This is given by V = MgZ, where Z is the vertical displacement of the 
centre of mass G of the plank. Elementary trigonometry (see Figure 9.6) shows that 
Z = a cos# + aO sin# — a, so that 

V = Mga(cos9 + # sin# — 1). 

We must now show that the constraint forces do no work. The rate of working 
of the constraint force R that the log exerts on the plank is R • v c , where is the 
velocity of the particle C of the plank that is instantaneously in contact with the log. 
But, since the plank rolls on the log, v c = 0 so that the rate of working of R is zero. 
Also, the internal constraint forces that enforce the rigidity of the plank do no work 
in total. Hence, the constraint forces do no work in total. 

Energy conservation therefore applies in the form 

T + Mga (cos # + # sin # — 1) = E. 

It follows that the equilibrium position (with the plank on top of the log) will be stable 
if V has a minimum at # = 0. Now V' = Mga9 cos# and V" = Mga (cos 6 — 6 sin#) 
so that, when # = 0, V' = 0 and V" = 1. Hence V has a minimum at # = 0 and so 

the equilibrium position is stable. ■ 

9.4 KINETIC ENERGY OF A RIGID BODY 

The general theory we have presented applies to any multi-particle system; in 
particular, it applies to the rigid array of particles that we call a rigid body. However, in 
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FIGURE 9.7 The rigid body B rotates about 
the fixed axis C D with angular velocity co. A 
typical particle P, moves on the circular path 
shown. 



order to make use of energy conservation in rigid body dynamics, we need to be able to 
express the kinetic energy T of the body in terms of the generalised coordinates. 

Rigid body with a fixed axis 

Figure 9.7 shows a rigid body B which is rotating about the fixed axis CD. (Imagine that 
the body is penetrated by a thin light spindle, which is smoothly pivoted in a fixed posi¬ 
tion.) A typical particle P,- of the body can move on the circular path shown. This circle 
has radius p,, where p, is the perpendicular distance of P, from the axis CD. Suppose 
that, at some instant, the angular velocity of B about the axis C D is co. Then the speed of 
particle P, at this instant is \co\pi, and its kinetic energy is jnij (cop,) 2 . The total kinetic 
energy of B is therefore 

T =^2 (k m i (VP') 2 ) = J (xi™* “> 2 . 

i =1 \/= 1 / 

Definition 9.2 Moment of inertia The quantity 


N 

IcD = ^2 m ' P' 2 

i=l 


(9.8) 


where pi is the perpendicular distance of the mass m, from the axis CD, is called the 
moment of inertia of the body B about the axis CD. 

The moment of inertia, as defined above, does not depend on the motion of the body 
B. It is a purely geometrical quantity (like centre of mass), which describes how the mass 
in B is distributed relative to the axis CD. The further the mass in B lies from the axis, the 
larger is the moment of inertia of B about that axis. In the theory of rotating rigid bodies, 
the moment of inertia plays a similar role to that played by mass in the translational motion 
of a particle. 

Our result may be summarised as follows: 
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Kinetic energy of a rigid body with a fixed axis 

Suppose the rigid body B is rotating about the fixed axis C D with angular velocity 
co. Then the kinetic energy of B is given by 

T = \l CD (o 2 , (9.9) 

where I C d is the moment of inertia of B about the axis CD. 


Example 9.8 Moment of inertia of a hoop 

Find the moment of inertia of a uniform hoop of mass M and radius a about its axis 
of rotational symmetry. 

Solution 

This is the easiest case to treat since each particle of the hoop has perpendicular 
distance a from the specified axis. The required moment of inertia is therefore 



where M is the mass of the whole hoop. ■ 

It is evident that, in order to solve problems that include rotating rigid bodies, we need 
to know their moments of inertia. These can be worked out from the definition (9.8), or 
its counterpart for continuous mass distributions. The Appendix at the end of the book 
contains examples of how to do this and also contains a table of common moments of 
inertia, including those for the uniform rod, hoop, disk and sphere. Most readers will 
find it convenient to remember the moments of inertia in these four cases. 

Example 9.9 Rotational kinetic energy of the Earth 

Estimate the rotational kinetic energy of the Earth, regarded as a rigid uniform sphere 
rotating about a fixed axis through its centre. 

Solution 

From the Appendix, we find that /, the moment of inertia of a uniform sphere about 
an axis through its centre is given by / = 2M R 2 /5, where M is the mass of the sphere 
and R its radius. The kinetic energy of the Earth is therefore given by 

T = \lco 2 = \MR 2 co 2 , 

where M is the mass the Earth, R is its radius, and co is its angular velocity. 

On inserting the values M = 6.0 x 10 24 kg, R = 6400 km and co = 7.3 x 10 -5 
radians per second, T = 2.6 x 10 29 J approximately. ■ 
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FIGURE 9.8 Two blocks of masses m and 2m 
are connected by a light inextensible string 
which passes over a circular pulley of mass 
Am and radius a. 



Example 9.10 Attwood’s machine 

Two blocks of masses in and 2ni are connected by a light inextensible string which 
passes over a uniform circular pulley of radius a and mass Am. Find the upward 
acceleration of the mass m. 

Solution 

The system is shown in Figure 9.8. We suppose that the string does not slip on the 
pulley and that the pulley is smoothly pivoted about its axis of symmetry. 

Let z be the upward displacement of the mass in (from some reference configura¬ 
tion) and u (= z) its upward velocity at time t. Then, since the string is inextensible, 
the mass 2 in must have the same displacement and velocity, but measured downwards. 
The angular velocity co of the pulley is determined from the condition that the string 
does not slip. In this case, the velocity of the rim of the pulley and the velocity of 
the string must be the same at each point where they are in contact, that is, au> = v. 
Hence co — v/a. Also, from the table in the Appendix, the moment of inertia of a 
uniform circular disk of mass M and radius a about its axis of symmetry is \Ma 2 . 
Hence, the total kinetic energy of the system is 

T = \mv 2 + \{2m)v 2 + { ( \{Am)a 2 ) = \mv 2 . 

The gravitational potential energy of the system (relative to the reference con¬ 
figuration) is 


V = mgz — {2m) gz = —mgz. 

We must now dispose of the constraint forces, (i) At the smooth pivot that 
supports the pulley, the reactions are perpendicular to the velocities of the particles 
on which they act. Hence these reactions do no work, (ii) Since there is no slippage 
between the string and the three material bodies of the system, the total work done 
by the string on the bodies must be equal and opposite to the total work done by the 
bodies on the string.* (iii) The internal forces that keep the pulley rigid do no work in 
total. Hence the constraint forces do no work in total. 


* Since this string is massless and inextensible, it can have neither kinetic nor potential energy so that the 
total work done on the string must actually be zero. 
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Energy conservation therefore applies in the form 

S 2 r 

2 HIV —lllgz = t , 

where E is the total energy. If we now differentiate this equation with respect to t 
(and cancel by iiiv), we obtain 


dv - 
~dt =5g 

which is the equation of motion of the system. Thus the upward acceleration of the 

mass in is g/5. (If the pulley were massless, the result would be g/3.) ■ 


Rigid body in general motion 

We now go on to find the kinetic energy of a rigid body that has translational as well as 
rotational motion. The method depends on the following theorem. 

Theorem 9.1 Suppose a general system of particles S lias total mass M and that its 
centre of mass G has velocity V. Then the total kinetic energy of S can he written in the 
form 


T = \MV 2 + T g , (9.10) 

where V = \ V\ and T° is the kinetic energy ofS in its motion relative to G. 

Proof. By definition. 




V 


= T - {(MV) -V — jV • (MV) + {M(V • V) 
= T - \MV 2 , 


as required. ■ 

The term \MV 2 can be regarded as the translational contribution to T. When the 
system S is a rigid body, T G also has a nice physical interpretation. In this case, the 
motion of S relative to G is an angular velocity co about an axis CD passing through G, 
as shown in Figure 9.9. It then follows from equation (9.9) that T G = \/cd 0)2 • This can 
be regarded as the rotational contribution to T. We therefore have the result: 
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FIGURE 9.9 A rigid body B in general 
motion. The centre of mass G has velocity V 
and B is also rotating with angular velocity co 
about an axis through G. 


D 



Kinetic energy of a rigid body in general motion 

Let B be a rigid body of mass M and and let G be its centre of mass. Suppose that 
G has velocity V and that the body is also rotating with angular velocity co about an 
axis C D passing through G. Then the kinetic energy of B is given by 

T = \MV 2 +{Icd co 2 , (9.11) 

where V = \V\ and I C d is the moment of inertia of B about the axis CD. The 
term \MV 2 is called the translational kinetic energy and the term 4 Icd 0)2 the 
rotational kinetic energy of B. 


Example 9.11 Kinetic energy of a rolling wheel 


Find the kinetic energy of the rolling wheel shown in Figure 2.8. 

Solution 

Assume the wheel to be uniform with mass M and radius b. Then its centre of mass C 
has speed u so that the translational kinetic energy is Mu 2 . Because of the rolling 
condition, the angular velocity of the wheel is given by co = u/b so that the rotational 
kinetic energy is \l (u/b) 2 , where / = \Mb 2 . The total kinetic energy of the wheel 
is therefore given by 

r - ^ + i (!**») (i)* - 2^!.. 

Example 9.12 Cylinder rolling down a plane 

A uniform hollow circular cylinder is rolling down a rough plane inclined at an angle 
a to the horizontal. Find the acceleration of the cylinder. 

Solution 

Suppose that, at time /, the cylinder has displacement x down the plane (from some 
reference configuration) and that the centre of mass G of the cylinder has velocity 
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FIGURE 9.10 A hollow circular cylinder rolls 
down a plane inclined at angle a to the 
horizontal. 



v (= x) down the plane. The angular velocity a> of the cylinder is then determined by 
the rolling condition to be co = v/b. The kinetic energy of the cylinder is therefore 

T = \Mv 2 + ±/co 2 = \Mv 2 + \l 

where M is the mass of the cylinder, and / is its moment of inertia about its axis of 
symmetry. From the Appendix, we find that I = Mb 2 so that the kinetic energy of 
the cylinder is given by T = M v 2 . 

The gravitational potential energy of the cylinder is given by V = —Mgx sin a. 
We must now dispose of the constraint forces. The reaction forces that the 
inclined plane exerts on the cylinder act on particles of the cylinder which, because of 
the rolling condition, have zero velocity. These reaction forces therefore do no work. 
Also the internal forces that keep the cylinder rigid do no work in total. Hence the 
constraint forces do no work in total. 

Conservation of energy therefore applies in the form 

M v 2 — Mgx sin a = E, 

where E is the total energy. If we now differentiate this equation with respect to t 
(and cancel by Mv ), we obtain 

dv , 

— = ^ p sin a, 
dt 2 * 

which is the equation of motion of the cylinder. Thus the acceleration of the cylin¬ 
der down the plane is jg sin a. (A block sliding down a smooth plane would have 
acceleration g sin cr.) ■ 

Example 9.13 The sliding ladder 

A uniform ladder of length 2 a is supported by a smooth horizontal floor and leans 
against a smooth vertical wall.* The ladder is released from rest in a position making 
an angle of 60° with the downward vertical. Find the energy conservation equation 
for the ladder. 


* Don’t try this at home! 
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FIGURE 9.11 A uniform ladder of mass M and length 2a is supported by a smooth 
horizontal floor and leans against a smooth vertical wall. At time t, its centre of mass G 
has (A, z)-coordinates (X, Z) and the ladder makes an angle 9 with the downward 
vertical. 


Solution 

Let 9 be the angle that the ladder makes with the downward vertical after time t. The 
(x, z)-coordinates of the centre of mass G are then given by 

X=asin0, Z=acos9, 

and the corresponding velocity components by 

X = (acos9)9, Z = — (as'inO)O. 

The angular velocity of the ladder at time t is simply 9 (sec Figure 9.11). The kinetic 
energy of the ladder is therefore given by 

T = \M (x 2 + T 2 ) + \_I9 2 = \Ma 2 9 2 + \_I9 2 , 

where M is the mass of the ladder and / is its moment of inertia about the horizontal 
axis through G. From the Appendix, we find that / = Mar /3 so that the kinetic 
energy of the ladder is given by T = (2Ma 2 /3)0 2 . 

The gravitational potential energy of the ladder is given by V = MgZ = 
Mga cos 9. 

We must now dispose of the constraint forces. The reaction forces that the smooth 
floor and wall exert on the ladder arc both perpendicular to the particles of the ladder 
on which they act. These reaction forces therefore do no work. Also, the internal 
forces that keep the ladder rigid do no work in total. Hence the constraint forces do 
no work in total. 

Conservation of energy therefore applies in the form 

\Ma 2 9 2 + Mga cos 9 = E , 

where E is the total energy. From the initial conditions 9 = 0 and 9 = tt/3 when 
t = 0, it follows that E = The energy conservation equation for the ladder 
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FIGURE 9.12 Two particles P and Q are 
connected by a light inextensible string and 
can move, with the string taut, on the surface 
of a smooth horizontal cylinder. 


Q 



o 


is therefore 



Since the system has only one degree of freedom, this equation is sufficient to deter¬ 
mine the motion. 

A curious feature of this problem (not proved here) is that the ladder does not 
maintain contact with the wall all the way down, but leaves the wall when 6 becomes 
equal to cos -1 (1 /3) % 71 °. ■ 


Problems on Chapter 9 


Answers and comments are at the end of the book. 

Harder problems carry a star (*). 

Potential energy and stability 

9.1 Figure 9.12 shows two particles P and Q, of masses M and ///, that can move on the 
smooth outer surface of a fixed horizontal cylinder. The particles arc connected by a light 
inextensible string of length na/ 2. Find the equilibrium configuration and show that it is 
unstable. 

9.2 A unifonn rod of length 2 a has one end smoothly pivoted at a fixed point O. The other 
end is connected to a fixed point A, which is a distance 2 a vertically above O, by a light 
elastic spring of natural length a and modulus g. The rod moves in a vertical plane through 
O. Show that there are two equilibrium positions for the rod, and determine their stability. 
[The vertically upwards position for the rod would compress the spring to zero length and is 
excluded.] 

9.3 The internal potential energy function for a diatomic molecule is approximated by the 

Morse potential 



where r is the distance of separation of the two atoms, and Vo, a, b are positive constants. 
Make a sketch of the Morse potential. 
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FIGURE 9.13 Two blocks of masses M and m 
slide on smooth planes inclined at angles a 
and fi to the horizontal. The blocks are 
connected by a light inextensible string that 
passes over a light frictionless pulley. 



Suppose the molecule is restricted to vibrational motion in which the centre of mass G of 
the molecule is fixed, and the atoms move on a fixed straight line through G. Show that there 
is a single equilibrium configuration for the molecule and that it is stable. If the atoms each 
have mass ///, find the angular frequency of small vibrational oscillations of the molecule. 

9.4* The internal gravitational potential energy of a system of masses is sometimes called 
the self energy of the system. (The reference configuration is taken to be one in which the 
particles are all a great distance from each other.) Show that the self energy of a uniform 
sphere of mass M and radius R is —3M 2 G/5R. [Imagine that the sphere is built up by the 
addition of successive thin layers of matter brought in from infinity.] 

Particles only 

9.5 Figure 9.13 shows two blocks of masses M and m that slide on smooth planes inclined 
at angles o' and fi to the horizontal. The blocks are connected by a light inextensible string 
that passes over a light frictionless pulley. Find the acceleration of the block of mass m up the 
plane, and deduce the tension in the string. 

9.6 Consider the system shown in Figure 9.12 for the special case in which the particles 
P, Q have masses 2 m, m respectively. The system is released from rest in a symmetrical 
position with 6, the angle between O P and the upward vertical, equal to 7r/4. Find the energy 
conservation equation for the subsequent motion in terms of the coordinate 6. 

* Find the normal reactions of the cylinder on each of the particles. Show that P is first to 
leave the cylinder and that this happens when 6 = 70° approximately. 

Ropes 

9 .7 A heavy uniform rope of length 2 a is draped symmetrically over a thin smooth horizontal 
peg. The rope is then disturbed slightly and begins to slide off the peg. Find the speed of the 
rope when it finally leaves the peg. 

9 .8 A uniform heavy rope of length a is held at rest with its two ends close together and the 
rope hanging symmetrically below. (In this position, the rope has two long vertical segments 
connected by a small curved segment at the bottom.) One of the ends is then released. Find 
the velocity of the free end when it has descended by a distance x. 

Deduce a similar formula for the acceleration of the free end and show that it always 
exceeds g. Find how far the free end has fallen when its acceleration has risen to 5 g. 
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FIGURE 9.14 The circular hoop rolls down the slope 
from one level to another. 



FIGURE 9.15 The roll of paper moves to the right and 
the free paper is gathered on to the roll. 


9.9 A heavy uniform rope of mass M and length 4 a has one end connected to a fixed point 
on a smooth horizontal table by light clastic spring of natural length a and modulus \Mg, 
while the other end hangs down over the edge of the table. When the spring has its natural 
length, the free end of the rope hangs a distance a vertically below the level of the table top. 
The system is released from rest in this position. Show that the free end of the rope executes 
simple harmonic motion, and find its period and amplitude. 

Rigid bodies 

9.10 A circular hoop is rolling with speed v along level ground when it encounters a slope 
leading to more level ground, as shown in Figure 9.14. If the hoop loses altitude h in the 
process, find its final speed. 

9.11 A uniform ball is rolling in a straight line down a rough plane inclined at an angle a to 
the horizontal. Assuming the ball to be in planar motion, find the energy conservation equation 
for the ball. Deduce the acceleration of the ball. 

9.12 A uniform circular cylinder (a yo-yo) has a light inextensible string wrapped around 
it so that it does not slip. The free end of the string is secured to a fixed point and the yo-yo 
descends in a vertical straight line with the straight part of the string also vertical. Explain why 
the string does no work on the yo-yo. Find the energy conservation equation for the yo-yo and 
deduce its acceleration. 
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9.13 Figure 9.15 shows a partially unrolled roll of paper on a horizontal floor. Initially the 
paper on the roll has radius a and the free paper is laid out in a straight line on the floor. The 
roll is then projected horizontally with speed V in such a way that the free paper is gathered 
up on to the roll. Find the speed of the roll when its radius has increased to b. [Neglect the 
bending stiffness of the paper. | Deduce that the radius of the roll when it comes to rest is 



9.14 A rigid body of general shape has mass A/ and can rotate freely about a fixed horizontal 
axis. The centre of mass of the body is distance h from the rotation axis, and the moment of 
inertia of the body about the rotation axis is /. Show that the period of small oscillations of 
the body about the downward equilibrium position is 



2 7T 


Deduce the period of small oscillations of a uniform rod of length 2a, pivoted about a horizon¬ 
tal axis perpendicular to the rod and distance b from its centre. 

9.15 A uniform ball of radius a can roll without slipping on the outside surface of a fixed 
sphere of (outer) radius b and centre O. Initially the ball is at rest at the highest point of the 
sphere when it is slightly disturbed. Find the speed of the centre G of the ball in terms of the 
variable 9, the angle between the line OG and the upward vertical. [Assume planar motion.] 

9.16 A uniform ball of radius a and centre G can roll without slipping on the inside surface 
of a fixed hollow sphere of (inner) radius b and centre O. The ball undergoes planar motion in 
a vertical plane through O . Find the energy conservation equation for the ball in terms of the 
variable 0, the angle between the line OG and the downward vertical. Deduce the period of 
small oscillations of the ball about the equilibrium position. 

9.17* Figure 9.6 shows a uniform thin rigid plank of length 2b which can roll without slip¬ 
ping on top of a rough circular log of radius a. The plank is initially in equilibrium, resting 
symmetrically on top of the log, when it is slightly disturbed. Find the period of small oscilla¬ 
tions of the plank. 
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Chapter Ten 


The linear momentum principle 


and linear momentum conservation 


KEY FEATURES 

The key features of this chapter are the linear momentum principle; its equivalent form, 
the centre of mass equation; and conservation of linear momentum. These principles are 
applied to rocket propulsion, collision theory, the two-body problem and two-body scat¬ 
tering. 

This chapter is essentially based on the linear momentum principle and its con¬ 
sequences. The linear momentum principle is the second of the three great principles 
of multi-particle mechanics* that apply to every mechanical system without restriction. 
Under appropriate conditions, the linear momentum of a system (or one of its compo¬ 
nents) is conserved. Important applications include rocket propulsion, collision theory, 
the two-body problem and two-body scattering. 


10.1 LINEAR MOMENTUM 


We begin with the definition of linear momentum for a single particle and for a 
system of particles. 

Definition 10.1 Linear momentum If a particle has mass m and velocity v, then p, its 
linear momentum, is defined to be 


( 10 . 1 ) 


p — mv. 


For a multi-particle system S consisting of particles P\, Pi, ■ ■ ■, Pn , with masses m \, m 2 , 
.... m N and velocities v\ , vj,..., i>/v (see Figure 9.1), P, the linear momentum of S. is 
defined to be the vector sum of the linear momenta of the individual particles, that is, 


N 


N 



( 10 . 2 ) 


* The other two are the energy and angular momentum principles. 
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Newton’s Second Law can be written in terms of linear momentum in the form 



Although this offers no advantage in the mechanics of a single particle, we will find that 
this type of formulation is very useful in multi-particle mechanics. The expression (10.2) 
can be written simply in terms of the motion of G, the centre of mass of S. Since the 
position vector R of G is given by 


R = 


E n 

i=\mirj 

E,=i no¬ 


where /•/ is the position vector of the particle P,, it follows that V , the velocity of G is 
given by 


■,_ Eii, **/»/ _ p 

where M (= £ m, ) is the total mass of the system S. Hence 

P = MV. (10.3) 

Thus the linear momentum of any system is the same as if all its mass were concentrated 
at its centre of mass. 

Although true for all systems, this result is most useful when finding the linear 
momentum of a moving rigid body. Note that the rotational motion of the rigid body 
does not contribute to its linear momentum; this contrasts with the corresponding calcula¬ 
tion of the kinetic energy of a rigid body (see Chapter 9). 


10.2 THE LINEAR MOMENTUM PRINCIPLE 

We now derive the fundamental result which relates the linear momentum of any 
system to the external forces that act upon it: the linear momentum principle. 

Suppose that the system «S is acted upon by the external forces {F,} and internal 
forces {Gjj}, as shown in Figure 9.3. Then the equation of motion for the particle P, is 


d v, 

m i ——- = Fi + / _ Gij, (10.4) 

1 ;=i 

where, as in Chapter 9, we take Gjj = 0 when i = j. Then the rate of increase of the 
linear momentum of the system S can be written 


dP 

~dt 


d 

It 



m-,Vj 


E 


/'=! 


nn 


dvj 

~d7' 


(10.5) 
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which, on using the equation of motion (10.4), gives 


dP 

IT 


N 


= E 


1=1 

N 


N 


N 


= T, F ' + 


i= 1 


F, + E G >j 

7=1 

= E F ' + E (E ( G 'V + G ;<) 
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1=1 \;=i 


EE g 


/=! 7=1 


ij 


where the terms of the double sum have been grouped in pairs and those terms known 
to be zero have been omitted. Now the internal forces {G, 7 } satisfy the Third Law, so 
that Gji = —G/j. Hence, each term of the double sum in equation (10.5) is zero and we 
obtain 


Linear momentum principle 


dP 

~dt 


F 


( 10 . 6 ) 


where F is the total external force acting on S. This is the linear momentum principle. 
This fundamental principle can be expressed as follows: 


Linear momentum principle 

In any motion of a system, the rate of increase of its linear momentum is equal to 
the total external force acting upon it. 


It should be noted that only the external forces appear in the linear momentum prin¬ 
ciple so that the internal forces need not be known. It is this fact which gives the linear 
momentum principle its power. 

10.3 MOTION OF THE CENTRE OF MASS 

The linear momentum principle can be written in an alternative form called the 
centre of mass equation, which is more useful for some purposes. If we substitute the 
expession (10.3) for P into the linear momentum principle (10.6) we obtain 


Centre of mass equation 


M 


dV 

~dt 


= F 


(10.7) 
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which is called the centre of mass equation. It has the form of an equation of motion 
for a fictitious particle of mass M situated at the centre of mass, which moves under the 
total of the external forces acting on the system <5. This important result can be simply 
expressed as follows: 


Motion of the centre of mass 

The centre of mass of any system moves as if it were a particle of mass the total 
mass, and all the external forces acted upon it. 


Example 10.1 Jumping cat 

A cat leaps off a table and lands on the floor. Show that, while the cat is in the air, its 
centre of mass moves on a parabolic path. 

Solution 

While the cat is in the air, the total external force on its body is due to uniform gravity, 
that is, F = —Mgk. The centre of mass equation for the cat is therefore 

dV 

M -= 

dt c 

which is precisely the equation of projectile motion for a single particle. The path 
of the centre of mass of the cat is therefore the same as if it were a particle of mass 
M moving freely under uniform gravity. This path is known (see Chapter 4) to be a 
parabola. ■ 

In previous examples, we have often used the Second Law to find an unknown con¬ 
straint force acting on a particle, once the motion of a system has been found by other 
means (see, for instance, Example 6.13). The centre of mass equation allows us to do 
the same thing when the unknown constraint force acts on a rigid body. The following 
examples illustrate the method. 

Example 10.2 Cylinder rolling down an inclined plane 

Consider again a hollow cylinder of mass M rolling down a rough inclined plane as 
shown in Figure 9.10. In Example 9.12, energy conservation was used to show that 
the acceleration of the cylinder down the plane is jg sin a. Deduce the reaction force 
exerted by the plane on the cylinder. 

Solution 

Suppose that the component of the reaction force normal to the plane is N , while 
the component of the reaction force up the plane is F . (The plane is rough so both 
components are present.) The cylinder is therefore subject to these ‘two’ external 
forces together with uniform gravity. The centre of mass equation for the cylinder 
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(when resolved into components tangential and normal to the plane) is given by 

dv 

M — = M^sina — F, 0=N —Mg cos a, 

dt 

where dv/dt — ^g sin a. It follows that the required reactions are given by 


F = sin cr, N = Mg cos a. 

Thus, if F and N are restricted by the ‘law of friction’ F/N < /x, then the supposed 
rolling motion of the cylinder cannot take place if tana 1 > 2/x. m 

Example 10.3 Sliding ladder 

Consider again the uniform ladder of length 2 a supported by a smooth horizontal 
floor and leaning against a smooth vertical wall, as shown in Figure 9.11. The ladder 
is released from rest with 0, the angle between the ladder and the downward vertical, 
equal to 60°. In Example 9.13, we used energy conservation to show that, in the 
subsequent motion, 0 satisfies the differential equation 

3 g 

0“ = —(1 — 2cos0), 

4 a 

provided that the ladder maintains contact with the wall. Deduce that the ladder loses 
contact with the wall when 0 — cos -1 (1/3). 

Solution 

Let the normal reactions exerted on the ladder by the smooth floor and wall be A/ f 
and N w respectively. Then the centre of mass equation for the ladder, resolved into 
horizontal and vertical components, is given by 

MX = N w , MZ = N F -Mg, 

where (X, Z) are the coordinates of the centre of mass of the ladder (see Figure 9.11). 
Hence 


N F = MZ + Mg , N w = MX. 

Now, in terms of the angle 6, X = a sin# and Z = a cos 6. On differentiating twice 
with respect to / , we obtain the corresponding acceleration components 

X = — tf(sinfl) 0 2 + a (cost?) 6, 


Z = — a {cos 6) 6 2 — a(sin0) 6. 


Hence 

N f = -Ma ((cosO)O 2 - (sin0)<?) + Mg, 
N w — Ma (—(sin0)0 2 + (cos0)0 V 
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In order to express these reactions in terms of 9 alone, we need to know 9 2 and 9 as 
functions of 9. From the previously derived equation of motion, we already have 

•9 3 g 

9 2 = — (1 - 2cost?) 

Act 

and, if we differentiate this equation with respect to t (and cancel by 6), we obtain 

.. 3 g 

0 = — sin#. 

4 a 

On making use of the above expressions for 9 2 and 9, the required reactions are 
found to be 

N f = ——(1 — 3cos0 + 9cos 2 0), N w = -—— sin<?(3cosd — 1). 

4 4 

We observe that the predicted value of N w becomes zero when 9 = cos -1 (1 /3) 
and is negative thereafter. Since negative values of N w cannot occur (the wall can 
only push), we conclude that the condition that the ladder maintains contact with the 
wall is violated when 9 > cos -1 (1/3). Therefore, the ladder leaves the wall when 
9 = cos -1 (1/3). ■ 

10.4 CONSERVATION OF LINEAR MOMENTUM 

Suppose that S is an isolated system, meaning that no external force acts on any 
of its particles. Then F, the total external force acting on S , is obviously zero. The linear 
momentum principle (10.6) for S then takes the form dP /dt = 0, which implies that P 
must remain constant. This simple but important result can be stated as follows: 


Conservation of linear momentum 

In any motion of an isolated system, the total linear momentum is conserved. 


It follows from equation (10.3) that the above result can also be stated in the alter¬ 
native fonn 7/t any motion of an isolated system, the centre of mass of the system moves 
with constant velocity'. Clearly the same result applies to any system for which the total 
external force is zero, whether isolated or not. 

It is also possible for a particular component of P to be conserved while other com¬ 
ponents are not. Let n be a constant unit vector and suppose that F • n = 0 at all times. 
Then 


d dP dn dP 

— (P • n) = —— • n + P • — = —— • n = F • n = 0. 
dt dt dt dt 


Hence the component P • it is conserved. This result can be stated as follows: 
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Before ejection After e j ec tion 

FIGURE 10.1 A rigid body of mass M (the rocket) contains a removable rigid block 
of mass m (the fuel). An internal source of energy causes the fuel block to be ejected 
backwards with speed u relative to the rocket and the rocket is projected forwards. 


Conservation of a component of linear momentum 

If the total force acting on a system has zero component in a fixed direction, then, 
in any motion of the system, the component of the total linear momentum in that 
direction is conserved. 


Conservation of linear momentum is an important property of a system and the sec¬ 
tions that follow rely heavily upon it. Two examples of momentum conservation are as 
follows: 

• The solar system is an example of an isolated system, being extremely remote 
from any other masses. It follows that the total linear momentum of the solar 
system is conserved. Thus the centre of mass of the solar system moves with 
constant velocity. 

• On the other hand, a grasshopper trying to move on a perfectly smooth horizontal 
table is not isolated, being subject to gravity and the vertical reaction of the table. 
However, since the grasshopper is not subject to any external horizontal force, it 
follows that, whatever the grasshopper tries to do, his component of total linear 
momentum in any horizontal direction is conserved. His vertical component of 
linear momentum is not conserved; he can leap into the air if he wishes. 


10.5 ROCKET MOTION 

An important application of linear momentum conservation is rocket propulsion. 
Figure 10.1 shows a rigid body of mass M (the rocket) which contains a removable rigid 
block of mass m (the fuel). The system is at rest when an internal source of energy causes 
the fuel block to be ejected backwards with speed u relative to the rocket. If the system is 
isolated, then its total linear momentum is conserved, which implies that 

Mv + m(y — u) = 0 
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Initially 


► 


v 


0 



After time r 


—► v(t) — u 

t. 

mass (-m)dt 


v(t) 



FIGURE 10.2 The rocket and its fuel at times t = 0 and t = r. The element of fuel 
ejected in the time interval [f, t + dt] has mass (-iii)dt and (forward) velocity 
v(t) - u. 


where v is the forward velocity of the rocket after the ejection of the fuel. As a result of 
this process the rocket acquires the forward velocity 

( m \ 

v = - u. 

\M + m ) 

This is the basic principle of rocket propulsion. The only mechanically significant dif¬ 
ference between the simple example above and real rocket propulsion is that, in the case 
of the real rocket, the fuel mass is ejected continuously over a period of time and not in 
a single lump. In practice, the fuel is burned continuously and the combustion products 
eject themselves due to their rapid expansion. 

Rocket motion in free space 

Figure 10.2 shows a more realistic situation. Initially the rocket and its fuel have combined 
mass m o and are moving with constant velocity uq. At time t = 0 the motors are started 
and fuel products are ejected backwards with speed u relative to the rocket. The fuel 
‘burn’ continues for a time T, at the end of which the rocket and unburned fuel have mass 
m i. Let m = m (t) be the mass of the rocket and its unburned fuel after time t. Then m is a 
decreasing function of t and the rate of ejection of mass at time t is —m. Let the system S 
consist of the rocket together with its fuel at time t = 0. After some time x into the burn, 
the mass of S is distributed as shown in Figure 10.2. The rocket and unburned fuel have 
mass m{x) and the remaining mass has been ejected as expended fuel. We will suppose 
that, once an element of fuel is ejected, it continues to move with the velocity it had at 
the instant of ejection.* Since we are assuming S to be an isolated system, its total linear 
momentum is conserved. The initial linear momentum in this one-dimensional problem 
is m quo and the final linear momentum of the rocket and unburned fuel is m(x)v(x), 


* This assumption simplifies our derivation but, as we will see, it is not essential. 
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where v (= v(t)) is the velocity of the rocket at time t. It remains to take account of 
the linear momentum of the ejected fuel. Consider the element of fuel that was ejected 
in the time interval [t, t + dt ]. This has mass (—m(t))dt and its forward velocity at the 
instant of ejection was v(t) — u. The linear momentum of this fuel element is therefore 
(— m){v — u)dt and the total linear momentum of the fuel expended in the time interval 
[0, r] is 


— I m(v — u ) dt. 

Jo 


Linear momentum conservation for the system <5 therefore requires that 


viqVq = m{r)v(r) — I m(v — u)dt , 

Jo 


which can be written in the form 


f [j, (mv 


) — m(v — u) 


dt = 0. 


Since this equality must hold for any choice of r during the burn, it follows that the 
integrand must be zero, that is 


— (mv) — tii (v — u) = 0 
dt 


for 0 < t < T . This simplifies to give 


Rocket equation in free space 

dv 

m — = (— m)u 
dt 


( 10 . 8 ) 


the rocket equation, which holds for 0 < t < T . The rocket equation can be interpreted 
physically as the Second Law applied to a system of variable mass* m(t), namely the 
rocket and its unburned fuel. In this interpretation, the term on the right, —mu, plays the 
role of force and is called the thrust supplied by the motors. 

Note. In our derivation, we assumed that, once an element fuel is ejected, it continues to move with 
the velocity it had at the instant of ejection. This is equivalent to assuming that each clement of ejected 


* This terminology is undesirable since, in classical mechanics, a ‘system’ means a fixed set of masses (or, 
at the very least, fixed total mass). No standard mechanical principle applies to a ‘system’ whose total 
mass is changing with time. 
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fuel is isolated from other fuel and from the rocket. It clearly makes no difference to the momentum of 
the ejected fuel if momentum is exchanged between elements of itself so that this assumption is actually 
unnecessary. However we must retain the assumption that ejected fuel has no further interaction with the 
rocket. This seems likely to be true in free space, but whether it is true just after take off from solid ground 
is questionable. 


Providing that the ejection speed u is constant, the rocket equation (10.8) can easily be 
solved for any mass ejection rate. On dividing through by m and integrating with respect 
to f, we obtain 



(-m)u 

- at 

m 



dm 

m 


—u In m + constant 


and, on applying the initial condition v — vq when t — 0, we obtain 


v(t) = v 0 + u In —— 

\m{t) 

This gives the rocket velocity at time t. In particular, at the end of the fuel burn, the rocket 
velocity has increased by 

Av — iq - vo — u In ^ ’ (10-9) 

where m i and v\ are the final mass and velocity of the rocket. One can make some 
interesting deductions from this solution. 

(i) Au, the increase in the rocket velocity, is directly proportional to w, the fuel ejection 
speed. Thus it pays to make u as large as possible. Chemical processes can produce 
values of u as high as 5000 ms -1 . 

(ii) If the fuel were all ejected in a single lump, Av would never exceed the ejection 
speed u. But when the fuel is ejected over a period of time, it is possible for 
the rocket to attain any velocity by making the mass ratio mo/m\ large enough. 
For example, if we wish to make Av = 3m, then we need nio/nii = e 3 ^ 20. 
This means that 19 kg of fuel would be required for every kilogram of payload. 
The amount of fuel needed to achieve higher velocities quickly makes the process 
impractical. To achieve An = 10m takes 22 metric tons of fuel for every kilogram 
of payload! 



Rocket motion under gravity 

Suppose now that the rocket is moving vertically under gravity. If we regard the govern¬ 
ing equation as the equation of motion for the variable mass m(t), then, when gravity is 
introduced, the equation of motion becomes 


Rocket equation including gravity 

dv 

m — = ( —m)u - mg 
dt 


( 10 . 10 ) 
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where v is measured vertically upwards, and the weight force mg means m(t)g. In this 
case, the effective force on the right is the sum of the thrust ( —m)u acting upwards and 
the weight force mg acting downwards. 

When the gravity is uniform and the ejection speed u is constant, the new rocket 
equation (10.10) can also be solved easily for any mass ejection rate. On dividing through 
by m and integrating with respect to t, we obtain 


f<v=f 

” ( —m)u 

g 

1 J 

m 


dt = —u lnw + gt + constant 


and, on applying the initial condition v = vq when / = 0 , we obtain 


( mo \ 

—777 - gt. 

m(t) / 

This gives the rocket velocity at time t. In particular, at the end of the fuel burn, the rocket 
velocity has increased by 



where m i and v\ are the final mass and velocity of the rocket and T is the time taken to 
burn all the fuel. 

It will be noticed that, if T is too large, then Av will be negative, which is hardly 
possible for a rocket standing on the ground. The reason for this paradox is that, if the fuel 
is burned too slowly then the thrust will be less than the initial weight of the rocket, which 
will not take off until its weight has become less than the thrust. We will therefore assume 
that (-rii)u > mg at all times during the burn so that the rocket has positive upward 
acceleration and achieves its maximum speed when t = T . If the rocket starts from rest, 
it then follows that the maximum speed achieved is 


^max — M In 



-gT. 


In this and the zero gravity case, the distance travelled during the burn depends on the 
functional form of m(t). 


10.6 COLLISION THEORY 

Another important application of linear momentum conservation occurs when we 
have an isolated system of two particles, and one particle is in collision with the other. 

Collision processes 

It is important to understand the meaning of the term ‘collision’. Suppose that the mutual 
interaction between the two particles tends to zero as the distance between them tends to 
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Before 


m 1 



u i 


After 


rrii 


m 2 



+ $2 


FIGURE 10.3 A collision between two particles viewed from the laboratory frame. 
A particle of mass m \ and initial velocity u collides with a ‘target’ particle of mass 
m 2 , which is initially at rest. After the collision, the particles have velocities u\ and 
«2 respectively. 9\ is the scattering angle of the mass m\, 62 is the recoil angle of 
the mass m 2 , and 0 (= 9\ + $ 2 ) is the opening angle between the emerging paths. 


infinity, so that, if the particles are initially a great distance apart, each must be moving 
with constant velocity. If the particles approach each other, then there follows a period 
during which their mutual interaction causes their straight line motions to be disturbed. If 
the particles finally retreat to a great distance from each other, then they will move with 
constant velocities again, and these final velocities will generally be different to the initial 
velocities. This is what we mean by a collision process. Note that a collision process 
is not restricted to those cases in which the particles make physical contact with each 
other. This can of course happen, as in the ‘real’ collision of two pool balls. However, the 
deflection suffered by an alpha particle in passing close to a nucleus is also a ‘collision’, 
even though the alpha particle and the nucleus never made contact. Collision processes are 
particularly important in nuclear and particle physics, where they are the major source 
of experimental information. 

General collisions 

Consider the collision shown in Figure 10.3. A particle of mass m \ and initial velocity u 
is incident upon a ‘target’ particle of mass m 2 which is initially at rest.* This is typical 
of the collisions observed in nuclear physics. After the collision we will suppose the 
particles retain their identities (and therefore their masses) and emerge with velocities ii\ 
and 112 respectively. How are the final and initial motions of the particles related? Clearly 
we cannot ‘solve the problem’ since we have not even said what the mutual interaction 
between the particles is. However, it is surprising how much can be deduced simply from 
conservation laws without any detailed knowledge of the interaction. 

Since the two particles form an isolated system, their total linear momentum is con¬ 
served, that is, 


* This means ‘at rest in the laboratory reference frame'. 
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m i u = m\U\ + 1712U2 


( 10 . 12 ) 


This linear relation between the vectors u, U\ and 112 implies that these three velocities 
must lie in the same plane so that scattering processes are two-dimensional. 

Generally, collisions are not energy preserving. The energy principle for the colli¬ 
sion has the form 


\m \u 2 + £? = \m\U 2 + \ni2u\. 


where u = |«|, U\ — |«i|, M 2 = | 2 1» and Q is the energy gained in the collision. In 
‘real’ collisions between large bodies, energy is usually lost in the form of heat, so that Q 
is negative. However, in nuclear collisions in which the particles change their identities, it 
is perfectly possible for energy to be gained. 

Example 10.4 Making Kraptons 

A little known particle physicist has proposed the existence of a new particle, with 
charge +2 and mass 2, which he has named the Krapton. He has calculated that this 
can be produced by the collision of two protons in the reaction* 

p + + p + + lOMeV K ++ 

Having failed to obtain funding to verify his theory, he has built his own equipment 
with which he accelerates protons to an energy of 16 McV and uses them to bombard 
a stationary target of hydrogen. Could he succeed in making a Krapton? 

Solution 

Suppose a proton with kinetic energy E collides with proton at rest. Then this sys¬ 
tem has initial linear momentum (2 inE) 1 ^ 2 , where in is the mass of a proton. This 
linear momentum is preserved by the collision so that, if a Krapton of mass 2m were 
produced, it would have linear momentum (2 mE) 1 ^ 2 and therefore kinetic energy 
E /2. Hence, only 8 MeV of the initial energy is available for Krapton building and, 
according to the physicist’s own calculation, this is not enough. (On the other hand, a 
head-on collision between two 5 MeV protons would be enough. Why?) ■ 

Elastic collisions 

The linear momentum equation (10.12) holds whether the collision is between pool balls, 
protons or peaches. Much more can be said if the collision is also energy preserving. 

Definition 10.2 Elastic collision A collision between particles is said to be elastic if 
the total kinetic energy of the particles is conserved in the collision. 


* The electron volt (eV) is a unit of energy equal to 1.6 x 10 19 J approximately. 


Copyrighted material 





258 


Chapter 10 The linear momentum principle 


Frame invariance In order that the above definition be physically meaningful, it is necessary that a colli¬ 
sion observed to be elastic in one inertial frame should also be elastic when observed from any other. This 
is not obviously true, since kinetic energy is not a linear quantity. However, since the total kinetic energy 
of the system can be written in the form T = T c M + T G (see Theorem 9.10), where T c M is preserved in 
the collision and T° is frame independent, it follows that any gain or loss of kinetic energy in the collision 
is independent of the inertial reference frame used to observe the event. 

Elastic collisions are very common and extremely important. For example, any col¬ 
lision in which the mutual interaction force is conservative is elastic. In particular, the 
collisions that occur in Rutherford scattering are elastic. In elastic collisions, we have 
energy conservation in the form 



(10.13) 


and, together with linear momentum conservation (10.12), we can make some interesting 
deductions. If we take the scalar product of each side of the linear momentum equation 
(10.12) with itself, we obtain 

9 9 9 9 9 9 

mi« = m i u l + 2 m\ni 2 U\ • 112 + m 2 u 2 , 

and, if we now eliminate the term in u 2 between this equation and the energy conservation 
equation (10.13), we obtain, after simplification. 


2 /7 /1 u\ • 112 = (m 1 — m 2 ) u 


2 

2 - 


(10.14) 


Since 11 1 • 1/2 = u \ U 2 cos 6, where 0 is the opening angle between the paths of the emerg¬ 
ing particles, the formula (10.14) can also be written 


cos 9 = 


(mi ~ mi) h 2 
2nt 1 u 1 


(10.15) 


provided that u 1 / 0, that is, provided that the incident particle is not brought to rest by 
the collision.* This formula holds tor all elastic collisions, whatever the nature of the 
particles and the interaction. It therefore applies equally well to pool balls^ and protons, 
but not peaches. Given the mass ratio of the two particles, formula (10.15) relates the 
speeds of the particles and the opening angle between their paths after the collision. 


* The incident particle can be brought to rest in a head-on collision with a particle of equal mass. 

T Collisions between pool balls are very nearly elastic. However, in the present treatment, we are disre¬ 
garding the rotation of the balls. 
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Example 10.5 Finding the final energies 

A ball of mass m and (kinetic) energy E is in an elastic collision with a second ball 
of mass 4m that is initially at rest. The two balls depart in directions making an angle 
of 120° with each other. What are the final energies of the two balls? 

Solution 

On substituting the given data into the formula (10.15), we find that U\/U 2 = 3. It 
follows that 


E | jinu^ 1 /w] \ 2 9 

£2 |(4 iii)Uj 4 \W2/ 4 

Hence E\ = and £2 = . ■ 

An important special case occurs when the two particles have equal masses. In this 
case, formula (10.15) shows that the opening angle must always be a right angle. Thus, in 
an clastic collision between particles of equal mass, the particles depart in directions at 
right angles. Note that this result applies only when the target particle is initially at rest. 

Example 10.6 Elastic collision between two electrons 

In an elastic collision between an electron with kinetic energy E and an electron at 
rest, the incoming electron is observed to be deflected through an angle of 30°. What 
arc the energies of the two electrons after the collision? 

Solution 

Since the collision is elastic and the electrons have equal mass, the opening angle 
between the emerging paths must be 90°. The target electron must therefore recoil at 
an angle of 60° to the initial direction of the incoming electron. Let the speed of the 
incoming electron be u and speeds of the electrons after the collision be u \ and 112 
respectively. Then conservation of linear momentum implies that 

mu = mu 1 cos 30° + mu 2 cos 60°, 

0 = mu\ sin 30° — mui sin60°, 

which gives u\ = j\/3w and U 2 = \u. Hence, after the collision, the electrons have 
energies | E and \E respectively. ■ 

10.7 COLLISION PROCESSES IN THE ZERO-MOMENTUM 
FRAME 

We have so far supposed that the inertial reference frame from which the scatter¬ 
ing process is observed is the one occupied by the experimental observer. This is called the 
laboratory frame (or lab frame) since it is the frame in which measurements (of scatter¬ 
ing angles, for instance) are actually taken. In the lab frame, the target particle is initially 
at rest. 
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initial and final momenta 


FIGURE 10.4 A collision between two particles viewed from the zero-momentum frame. 
The initial momenta p, . p 2 are equal and opposite, as are the final momenta p \, p\. The 
angle is the angle through which each of the masses is scattered. 


However, it is very convenient to ‘view' the scattering process from a different inertial 
frame. Since the two particles form an isolated system, their centre of mass G moves with 
constant velocity and so the frame* in which G is at rest is inertial. In this frame, the 
total linear momentum of the two particles is zero and, for this reason, we call it the 
zero-momentum frame " or ZM frame. 

Consider, for example, the scattering problem which, in the lab frame, is shown in 
Figure 10.3. Then the total linear momentum P = m \ u and the velocity V of the centre 
of mass of the two particles is 


V = 


m i u 

m\ + in 2 


(10.16) 


This therefore is the velocity of the ZM frame relative to the lab frame for this collision 
process. 

Collisions viewed from the ZM frame 

Two-particle collisions look simple when viewed from the ZM frame. This is because, 
since the total linear momentum is now zero, the initial linear momenta p\, p 2 of the two 
particles and the final momenta p \, p' 2 of the two particles must satisfy 

p ] + p 2 = 0, P\ + P ' 2 = 0 (10.17) 

Thus, when a two-particle collision is viewed from the ZM frame, the initial momenta 
are equal and opposite and so are the final momenta. Figure 10.4 shows what a two- 
particle collision looks like when viewed from the ZM frame. Because of the relations 


* This frame has the same velocity as G, and no rotation, relative to the lab frame. 

T The term ‘centre of mass frame’ is also used. However, ‘zero-momentum frame’ is preferable since this 
notion holds good in relativistic mechanics. 
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(10.17), the particles both arrive and depart in opposite directions, so that each particle is 
deflected through that same angle x\r. All this follows solely from conservation of linear 
momentum. We can say more if we also have an energy principle of the form 

\tn 11 t>i |“ + \1n2\v2\~ + Q = 1|Vj | 2 + \ni2\V2\ 2 , 


where i>i, v 2 , v \, v'-, are the initial and final velocities of the particles (as shown in Figure 
10.4), and Q is the kinetic energy gained as a result of the collision.* Let p be the common 
magnitude of the initial momenta P\, p 2 , and p' be the common magnitude of the final 
momenta p\, p\. Then the energy balance equation can be re-written in the form 


'2 


j2 


+ 


2/7 /1 2m 2 


+ Q 


+ 


2/7/ 1 2/7/ 2 


that is, 


P 


a 


/ 2g/77 1/7/2 \ 
\«i + tn2 ) 


(10.18) 


Thus the magnitudes of the initial and final momenta are related through Q , the energy 
gained in the collision. This is depicted in the momentum diagram in Figure 10.4. The 
magnitudes of the initial and final momenta (p and p') arc represented by the radii of the 
two dashed circles. The diagram shows the case in which p' > p, which corresponds to 
Q > 0. For an elastic collision, the circles are coincident and all four momenta have 
equal magnitudes. 

In a typical scattering problem, the masses m \ , /// 2 and the initial momenta p\ , p 2 are 
known. For the scattering problem shown (in the lab frame) in Figure 10.3, i/j = u — V 
and t /2 = —V, where V is given by equation (10.16). It follows that the initial momentum 
magnitude in the ZM frame are given by 


/// \ni2u 
m 1 + m 2 ’ 


(10.19) 


where u = |«|. The scattering process is now entirely determined by the parameters Q 
(the energy gain) and xfr (the ZM scattering angle). Given p and Q, p' is determined 
from equation (10.18). Together with \jr, this determines the final momenta p x and p 2 . 
The parameters Q and \J/ depend on the physics of the actual collision. For instance, the 
collision may be known to be elastic, in which case Q — 0. The question of how the 
scattering angle xj/ is related to the actual interaction and initial conditions is addressed in 
section 10.9. 


Returning to the lab frame (elastic collisions only) 

Although the scattering process looks simpler in the ZM frame, we usually need to know 
the details of the scattering actually observed by the experimenter in the lab frame. This 


* As remarked earlier, Q is frame independent and so is the same as the energy gain measured in the lab 
frame. 
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FIGURE 10.5 The final particle velocities u\, 
//2 in the lab frame are obtained from the 
final velocities i/p v ' 2 in the ZM frame by the 
relations u\ = v\ + V y ui — + V. The 

diagram shows the elastic case, in which the 
velocity triangle for 1/2 is isosceles. 


v 



entails transforming the properties of the final state (velocities, momenta and kinetic ener¬ 
gies) from the ZM frame hack to the lab frame. Since the ZM frame has velocity V (given 
by (10.16)) relative to the lab frame, the final velocities u\, 112 observed in the lab frame 
are related to the final velocities t/ p v'-, in the ZM frame by 

U\ = vi V, m 2 = v-, -|- V. (10.20) 


Any other properties can then be found from u 1 , uj. The transformations (10.20) are 
depicted geometrically in Figure 10.5. The transformation formulae become rather com¬ 
plicated in the general case, but simplify nicely when the collision is elastic. From now on 
we will restrict ourselves to elastic collisions only. In this case, Q = 0 and the collisions 
are parametrised by xj/ alone. The energy equation (10.18) then implies that p = p so 
that the four momentum magnitudes are equal, and given by equation (10.19). The final 
speeds of the particles in the ZM frame are therefore given by 


v i = 


niiu 


111 1 + m 2 


, 111 1 u 

; 2 


tA =-= V, 


m 1 + m 2 


( 10 . 21 ) 


where V = |V|. We may now deduce the required infonnation from Figure 10.5. The lab 
scattering angle 6 \ can be expressed in terms of the parameter angle \fr by 


tan#i 


t/, sin sin iff 

v\ cos xj/ + V cos xj/ + (V/v j ) 


sin xj/ 

cos xj/ + {1111/1112) ’ 


on using equations (10.21). The lab recoil angle 62 is easily found since, in an elastic 
collision, the velocity triangle for 112 is isosceles with angles xj/, 62 and 62 , as shown in 
Figure 10.5. It follows that 


02 = \{ 7 l ~ Xj/). 
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The expression for the lab opening angle # (= 0\ + # 2 ) is therefore given by 


tan# = tan(#i + #2) = 


tan #1 + tan #2 
1 + tan #1 tan #2 


' m 1 + m 2 \ 
,m 1 - m2) 


cot(i^r), 


after some simplification. 

To find the final energies, we observe that 


U2 = 2 V s\n(\xf/) 


so that £ 2 , the final lab energy of the mass m 2 is given by 

£2 \m 2 (2V sin(^)) 4wiW2 

- = -:- = -- sin (ii/r), 

£0 \m\u 2 (mi + m2) 2 2 

where £0 (= \m\U 2 ) is the lab energy of the incident mass m\. Since the collision is 
clastic, the final lab energy of the mass m \ is simply deduced from the energy conservation 
formula E\ + £2 = £()• 

The above formulae give the properties of the final state following an elastic two- 
particle collision in terms of the ZM scattering angle xjs. We will call them the elastic 
collision formulae and they are summarised below: 


Elastic collision formulae 

sin xl/ , 

A. tan#] = - B. 62 = — x/s) 

cos \fr + y 

C. tan# = cot(i^) D. — =——— r-sin 2 (ii/0 

\y - 1 / 2 £0 (y + 1) 2 2 

( 10 . 22 ) 

is the scattering angle in the ZM frame, and y = m \/ni 2 , the mass ratio of the 
two particles. 


Using the elastic collision formulae A word of advice about the use of these formulae may 
be helpful. Most questions on this topic tell you some property of the scattering in the lab 
frame and ask you to find another property of the scattering in the lab frame; the ZM frame is 
never mentioned. It is inadvisable to start manipulating the elastic scattering formulae. This 
is almost guaranteed to cause errors. The simplest method is as follows: (i) Use the given data 
to find xf by using the appropriate formula ‘backwards’, and then (ii) use this value of xf/ to 
find the required scattering property. In short, the advice is ‘go via xjr\ 


Copyrighted material 











264 


Chapter 10 The linear momentum principle 


Example 10.7 Using the elastic scattering formulae 

In an experiment, particles of mass m and energy E arc used to bombard stationary 
target particles of mass 2 m. 

Q. The experimenters wish to select particles that, after scattering, have energy E /3. 
At what scattering angle will they find such particles? 

A. If £|/£o = 1/3, then by energy conservation E 2 /E 0 = 2/3. First use formula D 
to find 1 If. Since the mass ratio y = 1/2, this gives 



so that x// — 120°. Now use formula A to find the scattering angle 0 \. This gives 
tan0| = 00 so that 6 \ = 90°. Particles scattered with energy E /3 will therefore be 
found emerging at right angles to the incident beam. 

Q. In one collision, the opening angle was measured to be 45°. What were the indi¬ 
vidual scattering and recoil angles? 

A. First use fonnula C to find xj/. This gives 

cot(^xfr) = 

so that \x// = 72°, to the nearest degree. Now use formula B to find the recoil angle 
62 . This gives 62 = 90° — 72° = 18°. The scattering angle 0 \ must therefore be 
6>, = e - 0 2 = 45° - 18° = 27°. 

Q. In another collision, the scattering angle was measured to be 45°. What was the 
recoil angle? 

A. First use formula A to find xj/. This shows that xjr satisfies the equation 

2 cos xj/ — 2 sin x// = 1, 
which can be written in the form* 

\f 8 cos ( x\r + 45°) = 1. 

This gives xj/ = 24°, to the nearest degree. Formula B now gives the recoil angle 62 
to be $2 = 78°, to the nearest degree. 

10.8 THE TWO-BODY PROBLEM 

The problem of determining the motion of two particles, moving solely under 
their mutual interaction, is called the two-body problem. Strictly speaking, all of the orbit 


* Recall that equations of the form a cos xj/ + b sin x/r = c are solved by writing the left side in the ‘polar 

9 9 9 

form’ R cos(^ — a), where = a + b and tana = b/a. 
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G 


o 


o 


FIGURE 10.6 The motion of P\ and Pi relative to their centre of mass (left), and the motion of P\ 
relative to P 2 (right). 

problems considered in Chapter 7 should have been treated as two-body problems since 
centres of force are never actually fixed. The one-body theory is a good approximation 
when one particle is much more massive than the other. When the two particles have 
similar masses, the problem must be treated by two-body theory, in which neither particle 
is assumed to be fixed. 

Let Pi and P 2 be two particles moving under their mutual interaction. By the Third 
Law, the forces that they exert on each other are equal in magnitude, opposite in direction, 
and act along the line joining them. We will further suppose that the magnitude of these 
interaction forces depends only on r, the distance separating P\ from P 2 . The forces F 1 , 
F 2 , acting on P\, P 2 , then have the form 


F, = F(r) r, 


F 2 = -F(r) r, 


where r = r\ — r 2 , r = |r 1 — r 2 \ and 7 = r/r (see Figure 10.6). The equations of 
motion for P \, P 2 are therefore 


m\r\ = F(r)?\ 


m 2 r 2 = — F(r)'r. 


(10.23) 


This is a generalisation of central force motion in which each particle moves under a force 
centred upon the other particle. Although this problem appears to be complicated, it can 
be quickly reduced to an equivalent one-bodv problem. 


We first observe that the two particles form an isolated system so that their total lin¬ 


ear momentum is conserved, or (equivalently) their centre of mass G moves with con¬ 
stant velocity. The motion of G is therefore determined from the initial conditions and it 
remains to find the motion of each particle relative to C, that is, their motions in the ZM 
frame. It turns out however that it is easier to find the motion of one particle relative to 
the other. The motion of each particle relative to G can then be easily deduced. 

The equation of relative motion 

It follows from the equations of motion (10.23) that 
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so that r, the position vector of P\ relative to P 2 , satisfies the equation 


Relative motion equation 


m\t1i2 
1 + m 2 . 


r = F{r)7, 


(10.24) 


which we call the relative motion equation. 

Definition 10.3 Reduced mass The quantity /a, defined by 

m \m 2 

P = ---• 

m 1 + m 2 


is called the reduced mass. 

Our result can be expressed as follows: 


(10.25) 


Two-body problem - the relative motion 

In the two-body problem, the motion of P\ relative to P 2 is the same as if fS were 
held fixed and P\ had the reduced mass \x instead of its actual mass m \. 


This rule* allows us to replace the problem of the motion of P\ relative to P 2 by an 
equivalent one-body problem in which P 2 is fixed. The solution of such problems is 
fully described in Chapter 7. 

Example 10.8 Escape from a free gravitating body 

Two particles P\ and P 2 , with masses m \ and 1112 , can move freely under their mutual 
gravitation. Initially both particles are at rest and separated by a distance c. With 
what speed must P\ be projected so as to escape from Pfl 

Solution 

Since this is a mutual gravitation problem, we take our rule in the form: The motion 
of P\ relative to P 2 is the same as if Pi were held fixed and the constant of gravitation 
G replaced by G', where 


G' = 


m 1 + m 2 
m 2 


) 


G. 


The rule is ambiguous when the force F also depends on m\, as in mutual gravitation. Do you also 
replace this m \ by /z? The answer is no, but the easiest way to avoid this glitch in the mutual gravitation 
problem is to make the transformation G —> (m \ + m^G /m2 instead. This has the correct effect and is 
not ambiguous. 
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FIGURE 10.7 Particles P\ and P 2 move under their mutual gravitation. In the zero 
momentum frame, the orbits are similar conics, each with a focus at G (left). The orbit of 
P\ relative to P 2 is a third similar conic with P 2 at a focus (right). 


From the one-body theory in Chapter 7, we know that P\ will escape from a fixed Pi 
if it has positive energy, that is if 


1 _, 2 ni\m 2 G 

1 V - - 


> 0. 


Hcncc, when P 2 is not fixed, P\ will escape if 

1 l/2 nixnnG' 


\nuV‘ 


> 0 , 


that is, if 


y2 > 2(///| +/?/ 2 )G 
c 

This is the required escape condition. ■ 

Once the relative motion of the particles has been found, one may easily deduce the 
motion of each particle in the ZM frame since (see Figure 10.6) 


r 


G 

1 


m 2 


jn\ + m 2 . 


r, 


r 2 =- 


m i 


, m i + m 2 , 


r. 


It follows that the orbits of P\, P 2 in the ZM frame are geometrically similar to the orbit 
in the relative motion. For instance, suppose that the mutual interaction of P\ and P 2 is 
gravitational attraction, and that the orbit of P\ relative to P 2 has been found to be an 
ellipse. Then the orbits of P\ and P 2 in the ZM frame are similar ellipses, as shown in 
Figure 10.7. The ratio of the major axes of these orbits is m 2 : m i, and the sum of their 
major axes is equal to the major axis of the orbit of P\ relative to P 2 . All three orbits have 
the same period r given by 


r 


2 


4;rV 

G (i m i + m2) ’ 


(10.26) 
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where a is the semi-major axis of the relative orbit. This formula is simply obtained from 
the one-body period formula (7.26) by replacing G by G'. 

Formula (10.26) shows that, in the approximate treatment in which Pi is regarded 
as fixed, the value of the period is overestimated by the factor 



which is a small correction when m\/rti 2 is small. In the Solar system, the largest value 
of m i/ni 2 for a planetary orbit is that for Jupiter, which is about 1/1000. 

Binary stars 

It is probable that over half of the ‘stars’ in our galaxy are not single stars, like the Sun, 
but occur in pairs* that move under their mutual gravitation. Such a pair is called a binary 
star. 

Binary stars are important in astronomy and also provide a nice application of our 
two-body theory. In particular, the two components of the binary must orbit their centre of 
mass on similar ellipses, as shown in Figure 10.7; the orbit of either component relative to 
the other is a third similar ellipse; and the period of all three motions is given by formula 
(10.26), where a is the semi-major axis of the relative orbit. 

One reason why binary stars are important in astronomy is that the masses of their 
component stars can be found by direct measurement; indeed they are the only stars for 
which this can be done. Suppose that the star is an optical binary, which means that both 
components are visible through a suitably large telescope. Then the period of the binary 
can be measured by direct observation. It is also possible to measure the major axis of the 
relative orbit. Once r and a arc known, formula (10.26) tells us the sum of the masses of 
the two components of the binary. 

Example 10.9 Sirius A and B 

A typical example of a binary is Sirius in the constellation Canis Major, the brightest 
star in the night sky. The large bright component is called Sirius A and its small dim 
companion Sirius B. The period of their mutual orbital motion is 50 years and the 
value of a is 20 AU. (This is about the distance from the Sun to the planet Uranus.) 
Find the sum of the masses of the two components of Sirius. 

Solution 

In terms of astronomical units, in which G = 47r 2 , formula (10.26) gives 


M a + Mb = —=• = 3.2 M q ■ 


Groups of three or more also occur. 
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In order to determine the individual masses of the components by optical means, it is 
necessary to find the a values for one of the individual components in its motion relative to 
the centre of mass. The procedure is essentially the same as before, but much more difficult 
observationally since the motion of the chosen component must be measured absolutely, 
that is, relative to background stars. In the case of Sirius, it is found that M A = 2.1 Mq 
and Mr =1.1 Mq. 


10.9 TWO-BODY SCATTERING 

An important application of two-body motion is the two-body scattering prob¬ 
lem. In our treatment of collision theory, we considered the whole class of possible colli¬ 
sions between two particles that were consistent with momentum and energy conservation. 
These collisions were parametrised by the ZM scattering angle i \r. We now consider the 
problem in more detail. Given the interaction between the particles and the impact param¬ 
eter p , what is the resulting ZM scattering angle? This question can be answered by using 
two-body theory. We break up the process into a number of steps: 

1. Find the {p, 0{-relation for the one-bodv problem 

First consider the one-body problem in which the particle Pj is held fixed , and work 
out (or look up) the relation between the impact parameter p and the scattering angle 6 . 
For example, the {p, 0{-relation for Rutherford scattering was derived in Chapter 7 and 
was found to be 

tan \0 = <7K/2 0 . (10.27) 

m\pu~ 


2. Find the {p, ^{-relation for the relative motion problem 

The next step is to find the relation between the impact parameter p and the scattering 
angle 0 observed in the relative motion problem. This is easily obtained from the one- 
body formula (10.27) by replacing m \ by p (the reduced mass) and replacing 6 by (p. This 
gives 



</K/2(l + y) 
m\pu 2 


where y (= m i /mf) is the ratio of the two masses. 


(10.28) 


3. Find the { p , t/r {-relation observed in the ZM frame 

The angle 0 that appears in the formula (10.28) is the scattering angle in the motion 
of m \ relative to m 2 . However, by an amazing stroke of good fortune, it is actually the 
same angle as the ZM scattering angle 0- that we used in collision theory.* Hence, the 


* The reason is as follows: The relative motion in the lab frame must be the same as the relative motion in 
the ZM frame. In this frame, the initial relative velocity of P\ is equal to ( P\/m\ ) — (P 2 / m 2 )> which 
has the same direction as p \. Likewise, the final relative velocity of P\ is equal to ( p'^/m \) - (p\ I m2). 
which has the same direction as pf Hence the scattering angle in the relative motion is the same as that 
in the ZM frame. 
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{p, 0}-relation when two-body scattering is observed from the ZM frame is obtained 
by simply replacing 0 in formula (10.28) by 0, that is, 



<7i< 72(1 + y) 
m\pu 2 


(10.29) 


As always, u means the speed of the incident particle observed in the lab frame. 

4. Find 6\ and 6b in terms of p from the elastic collision formulae 

Since the {p, 0}-relation (10.29) gives the ZM scattering angle 0 in terms of p , this 
expression for 0 can now be substituted into the elastic scattering formulae (10.22A) and 
(10.22B) to give expressions for the two-bodv scattering angle 0 \ , and recoil angle 6b in 
terms of p. For Rutherford scattering, this gives, after some simplification, 


Two-body Rutherford scattering formulae 


tan d\ = 


4<i\(/2pE _ 

4 p 2 E 2 - (1 - y 2 )q 2 q 2 


tan 02 = 


2 pE 

<7W2(1 + y) 


(10.30) 


where E (= \m\u 2 ) is the energy of the incident particle and y = mx/mj- 

These formulae simplify further when the particles have equal masses. In this special 
case, y = 1 and the scattering and recoil angles are given by 


tan 0 \ = 


<7i<72 
pE ' 


tan 6b = 


pE 

qm 


(10.31) 


(As expected, 6 \ + 6b = .) These formulae would apply, for example, to the scattering 

of alpha particles by helium nucei. 

Two-body scattering cross section 

Having found the {/?, <9j}—relation for the two-body scattering problem, the two-bodv 
scattering cross section a 1 H is given, in principle, by the formula 


a TR 


(0 ,) = 


P dp 
sin 6^i (I0x 


However, this requires that the {p, 0\ }-relation be solved to give p as a function of 0 \ and 
the resulting algebra is formidable. 

The following method has the advantage that a TH is determined directly from the cor¬ 
responding one-body scattering cross section. The trick is to introduce the ZM scattering 
angle 0. By the chain rule, 


dp dp </0 

d6\ r/0 d0\' 
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and so ct tb can he written 


a TB = 


where a ZM is defined by 


P dp 
sin 9\ dd i 
sin t fr 
sin d\ 
sin i/r \ / dxjs 
^sin/ \dd x 


P 


sin d\ 


)(m 


dp d x// 
jlxfs X dd\ 
P d P \ 


O 


ZM 


m = 


sin x// dxfs, 




P d P 
sin i f/ dxfr 


(10.32) 


Now a 7M (xjs) is easily obtained from the one-body cross-section a(6) by replacing m\ 
by p and 6 by x//. The two-body cross section is then given by 


Two-body scattering cross section 


T n ( sin i Ir 

o TB {6 x )= l r 


sin 6 



ZM 

G 


(f) 


(10.33) 


In this formula, we have yet to replace \J/ by its expression in terms of 6 \. To do this, we 
must invert the formula (10.22 A) to obtain \f/ as a function of d\. Formula (10.22 A) can 
be rearranged in the fonn 


sin (\J/ — d [) = y sin 0 \, 


from which we obtain 


\f/ — 0\ + sin '(ysinfli) 


and, by differentiation. 


d\f/ y cos 0 1 

dd i (1 — y 2 sin 2 d\) { / 2 


(10.34) 


(10.35) 


These expressions for x// and dxfr/dd\ in terms of d\ must now be substituted into equation 
(10.33) to obtain the final formula for the two-body scattering cross section a TB {d\). 
These operations can be done with computer assistance. 

For example, in Rutherford scattering, we first obtain a ZM (x/r) by replacing m \ by 
p (and d by x/s) in the one-body cross section formula (7.37) obtained in Chapter 7. This 
gives 


a ZM {xjr) 


q\ql<\ + y? ( i \ 

4 mju 4 \sin 4 jxfs J 


(10.36) 
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FIGURE 10.8 The Rutherford two-body 
scattering cross section a TB plotted against 
the scattering angle 6\ {jt/ 2 < 6\ < n) for 
various values of the mass ratio 
y (= ni \ /m 2 )■ E is the kinetic energy of the 
incident particles. 



The two-body Rutherford scattering cross section is now obtained by substituting the 
expression (10.36) into the general fonnula (10.33) and then replacing xj/ and d\f//d0\ by 
the expressions (10.34), (10.35). After much manipulation, the answer is found to be 


th _ ( l\ / 4(1 + y) 2 (y cos 6 \ + S) 2 \ 

16E 2 yS(l + y sin 2 d\ - cos6\S) 2 J 


(10.37) 


where 


S = ^1 — y 2 sin 2 


1/2 


and E (= ^ ni\U 2 ) is the energy of the incident panicle. 

Figure 10.8 shows graphs of a TB {d\) in Rutherford scattering for various choices of 
the mass ratio y. In Rutherford’s actual experiment with alpha particles and gold nucleii, 
the value of y was about 0.02 and the error in the scattering cross section caused by using 
the one-body theory was less than 0.1%. However, as the graphs show, larger values of y 
can give rise to a substantial deviation from the one-body theory. 

When the mass ratio y (= mx/mj) is small, the formula (10.37) is approximated by 


a TB = 


- 2r 2 + 0(y*)). 

16 E 2 \sin 4 (0i/2) ^ ') 


Thus, when y is small, the leading correction to the one-body approximation is a constant. 


Equal masses 

The whole process of finding o TB simplifies wonderfully when the two particles have 
equal masses. In this case, xfr = 26\, dxfr/d0\ = 2, and the general fonnula (10.33) 
becomes 


a TB (d 1 ) = 4cos d\ a ZM (26\) (0 < 0 \ < it/ 2). 
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For example, in Rutherford scattering where the particles have equal masses, a TH has the 
simple form 


t d i 9 / cos 0 j 

c t tr (0i ) = 1 2 ' 


(0 < 6>i < tt/2). 


£ 2 Vsin 4 6 fi 7 

This fonnula would apply, for example, to the scattering of protons by protons. 


10.10 INTEGRABLE MECHANICAL SYSTEMS 

A mechanical system is said to be integrable if its equations of motion arc sol¬ 
uble in the sense that they can be reduced to integrations.* The most important class 
of integrable systems arc those that satisfy as many conservation principles as they have 
degrees of freedom. Suppose that a mechanical system S has n degrees of freedom and 
that it satisfies n conservation principles. Then it is certainly true that the n conservation 
equations are sufficient to determine the motion of the system, in the sense that no more 
equations are needed. More importantly though, it can be shown 1 that these equations can 
always be reduced to integrations. The system S is therefore integrable. 

Before we can apply this method to particular systems, there is a kinematical prob¬ 
lem to be overcome, namely: how does one find the velocities (and angular velocities) 
of the elements* of S when there are two or more generalised coordinates which vary 
simultaneously? The answer is by drawing a velocity diagram for S as described below: 


Drawing a velocity diagram 

• Draw the system in general position and select a set of generalised coordinates. 

• Let the first generalised coordinate vary (with the other coordinates held constant) 
and mark in the velocity of each element. 

• Now let the second generalised coordinate vary (with the other coordinates held 
constant) and, on the same diagram, mark in the velocity of each element. Con¬ 
tinue in this way through all the generalised coordinates. 

• Then, when all the generalised coordinates are varying simultaneously, the veloc¬ 
ity of each element of S is the vector sum of the velocities given to that element 
when the coordinates vary individually. 

In the above, ‘velocity’ means ‘velocity and/or angular velocity’. 


* The system is still said to be integrable even when the integrals cannot be evaluated in terms of standard 
functions! 

' This is Liouville’s theorem on integrable systems (see Problem 14.15) 

f The elements of S are the particles and/or rigid bodies of which S is made up. One needs to find (i) the 
velocity of each particle, (ii) the velocity of the centre of mass of each rigid body, and (iii) the angular 
velocity of each rigid body, in each case in terms of the chosen coordinates and their time derivatives. 
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FIGURE 10.9 Constructing a velocity diagram. Figure (a) shows the system and the 
coordinates x and 9. Figure (b) shows the velocities generated when x varies with 9 held 
constant. Figure (c) shows the velocities generated when 9 varies with x held constant. 
Figure (d) is the velocity diagram which is formed by superposing the velocities in 
diagrams (b) and (c). Note that the velocity of Pi is the vector sum of the two contributions 
shown. 


Example 10.10 Drawing a velocity diagram 1 

The system shown in Figure 10.9 consists of two particles P\ and P 2 connected by 
a light inextensible string of length a. The particle P\ is also constrained to move 
along a fixed horizontal rail and the whole system moves in the vertical plane through 
the rail. Take the variables x and 0 shown as generalised coordinates and draw the 
velocity diagram. 

Solution 

The construction of the velocity diagram is shown in Figure 10.9. ■ 

Example 10.11 Drawing a velocity diagram 2 

Two rigid rods C D and DE, of lengths 2 a and 2b, are flexibly jointed at D and can 
move freely on a horizontal table. Choose generalised coordinates and draw a velocity 
diagram for this system. 

Solution 

Let Oxy be a system of Cartesian coordinates in the plane of the table. Let (X, Y) be 
the Cartesian coordinates of the centre of the rod CD. and let 9 and 0 be the angles 
that the two rods make with positive jr-axis. Then X,Y,9, <p arc a set of generalised 
coordinates for this system. These coordinates, and the corresponding velocity dia¬ 
gram are shown in Figure 10.10. There are four contributions to the velocity of the 
centre of the rod DE . Also, each rod has an angular velocity. ■ 

We will now solve the system shown in Figure 10.9 by using conservation principles. 


Copyrighted material 











10.10 Integrable mechanical systems 


275 



FIGURE 10.10 The velocity diagram for a system with four degrees of freedom. The figure on the 
left shows the system and the generalised coordinates X , Y , 9 , (p . The figure on the right is the 
completed velocity diagram. 


Example 10.12 Solving an integrable system 

Consider the system shown in Figure 10.9 for the case in which P\ and P 2 have 
masses 3/// and ///, the rail is smooth, and the system moves under uniform gravity. 
Initially, the system is released from rest with the string making an angle of n /3 with 
the downward vertical. Use conservation principles to obtain two equations for the 
subsequent motion. 

Solution 

Let i be the unit vector parallel to the rail (in the direction of increasing x). Since 
the rail is smooth, all the external forces on the system arc vertical which means that 
F • i = 0. This implies that P • i, the horizontal component of the total linear 
momentum, is conserved. From the velocity diagram, the value of P • i at time t is 
given by 


P • i = 3 mx + m (x + (a0) cos$) = 4 mx + mad cos 0. 

Also, since the motion is started from rest, P • i = 0 initially. Hence, conservation 
of P • i implies that 


4x + a6 cos 9 = 0, (10.38) 

on cancelling by m. This is our first equation for the subsequent motion. 

Since the rail is smooth, the constraint force exerted by the rail does no work and 
the tensions in the inextensible string do no total work. Hence energy is conserved. 
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From the velocity diagram, the kinetic energy of the system at time t is given by* 

T = |(3 m)x 2 + \m (x 2 + (ad ) 2 + 2x(ad) costf) 

= ( 4 .x 2 + a 2 d 2 + 2 axd cos 6^ . 

The gravitational potential energy of the system at time I is given by 

V = 0 — mga cos d. 

Since the system was released from rest with 0 = 60°, the initial value of T is zero, 
while the initial value of V = — \mga. Hence, conservation of energy implies that 

jin (Ax 2 + a 2 d 2 + 2 axQ cos — mga cos 0 = — ^mga, 

which simplifies to give 

4x 2 + a 2 0 2 + 2axb cos 6 — ga ( 2 cos 0 — 1). (10.39) 

This is our second equation for the subsequent motion. 

Since this system has two degrees of freedom and satisfies two conservation prin¬ 
ciples, it must be integrable. Hence, the conservation equations (10.38), (10.39) must 
be soluble in the sense described above. ■ 


Question Equation ford 

Deduce an equation satisfied by 0 alone and find the speeds of P\ and P-± when the 
string becomes vertical. 


Answer 

From the linear momentum equation (10.38), 

x = —^ad cos d 


and, if we now eliminate i from the energy equation (10.39), we obtain, after simpli¬ 
fication. 


d 2 = 


4 # 

a 


2 cos d — I \ 

4 — cos 2 d ) ' 


(10.40) 


which is an equation for d alone. 

It follows from this equation that, when the string becomes vertical (that is, when 
d = 0), d 2 = 4g/3a. Hence, at this instant, 0 = — (4g/3a)^ 2 and (from the momen¬ 
tum conservation equation) x = +(g/\2a)^ 2 . Hence, the speed of P\ is (ag/\2)^ 2 
and the speed of Pj is |jc + ad\ = (?>ag/4) 1 ^ 2 . ■ 


* Suppose a velocity V is the sum of two contributions, iq and V 2 , so that V = v\ + vi- Then 

|V| 2 = V • V = (iq + i> 2 ) * («>l + v l) = *0 • *>1 + v 2 ’ v 2 + 2v\ • v 2 = |«q| 2 + 11>21 2 + 2iq • t>2- 
This formula was used to find the kinetic energy of particle P 2 . 
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nothing about this orientation and so the total force on the body is not known and the motion 
of G cannot be determined. 

There arc however some important exceptions: 

• Consider a rigid body moving without rotation. In this case the motion of G deter¬ 
mines the motion of every particle of the body. Then the total external force on the 
body is known and the motion of G can be determined. This, in turn, determines 
the motion of the whole body. For example, the problem of a block sliding without 
rotation on a table can be completely solved by particle mechanics. 

• Consider any system moving solely under uniform gravity. In this case, the total 
external force on the system is a known constant and the motion of G can be deter¬ 
mined. This docs not however determine the motion of the individual particles. For 
example, if the system were a brick thrown through the air, then particle mechanics 
can calculate exactly where its centre of mass will go, but not which particle of the 
brick will hit the ground first. 


In the general case however, we must use approximations. For example, suppose that the 
particles of the system move in the force fields F,(r) so that the total force on the system is 


N 

1=1 


In general, this is not equal to F, (/?). We can however approximate F,(r,) by ^ F, (K), 
in which case we are assuming that the ratio 


|F,(r,) - Fj(R)\ 
\Fi(R)\ 


(10.41) 


for all /. In the following argument we investigate when this condition can be expected to be 
hold. 


Let 8j be the position vector of the particle P, of the system relative to G. Then 
F i (R + 8,)-F i (R)= ) |S/| + <?(|«/| 2 ), 


where dF, /ds means the directional derivative of Fj in the direction of the displacement 5/. The condition 
(10.41) therefore requires that 



\Fi(R)\ 


« 1 


for all / and for all values of R that are attained in the motion of the system. This will hold if 


A « 


\Fi(R)\ 



(10.42) 


for all i and for all points on the path of the centre of mass. Here ‘max’ means the maximum over all 
directions, and A is the ‘radius’ of the system (the maximum distance of any particle of the system from the 
centre of mass). Thus the radius of the system is required to be small compared with the quantities above, 
not the lateral extent of the motion. 
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Although the condition (10.42) looks formidable, its physical meaning is quite simple: the 
radius of the system is required to be small compared with a length scale over which any of 
the force fields vary significantly. 

Consider for example a body moving under the gravitational attraction of a mass M o which 
is fixed at the origin O. In this field, the particle Pj of the system moves under the force field 


Fi(r) 


niiMoG 


r,. 


For this field, the right side of the condition (10.42) evaluates to give R /2, where R (= | R |) is 
the distance of the centre of mass of the body from O . Therefore the total gravitational force 
on the body will be accurately approximated by the force 


F(R ) 


MM 0 G~ 
-;— R 


(where M is the total mass of the body), if A R at each point on the path of the centre 
of mass. This means that the radius of the body must be small compared with its distance of 
closest approach to the centre O. This condition has no direct connection with the ‘extent of 
the motion’. Indeed, on a hyperbolic orbit, the path is infinite, but the condition (10.42) will 
not hold if the path passes too close to the centre of force. Similar remarks apply to motion of a 
body in any central field governed by a power law. If however the field were that corresponding 
to the Yukawa potential 



where k, a arc positive constants, then A is required to be small compared with the length 
scale a as well as the distance of closest approach to O. 


Problems on Chapter 10 

Answers and comments are at the end of the book. 

Harder problems carry a star (*). 

Linear momentum principle & centre of mass equation 

10.1 Show that, if a system moves from one state of rest to another over a certain time interval, 
then the average of the total external force over this time interval must be zero. 

An hourglass of mass M stands on a fixed platform which also measures the apparent 
weight of the hourglass. The sand is at rest in the upper chamber when, at time t = 0, a 
tiny disturbance causes the sand to start running through. The sand comes to rest in the lower 
chamber after a time t — r. Find the time average of the appararent weight of the hourglass 
over the time interval |0, r]. [The apparent weight of the hourglass is however not constant 
in time. One can advance an argument that, when the sand is steadily running through, the 
apparent weight of the hourglass exceeds the real weight!] 

10.2 Show that, if a system moves periodically, then the average of the total external force 
over a period of the motion must be zero. 


Copyrighted material 






10.10 Problems 


281 


where y = M ////. [Assume that the thrust is such that the rocket lakes off immediately.] 

10.8 Saturn V rocket In first stage of the Saturn V rocket, the initial mass was 2.8 x 10 6 kg, of 
which 2.1 x 10 6 kg was fuel. The fuel was burned at a constant rate over 150 s and the exhaust 
speed was 2, 600 ms -1 . Use the results of the last problem to find the speed and height of the 
Saturn V at first stage burnout. |Take g to be constant at 9.8 m s -2 and neglect air resistance.] 

10.9 Rocket in resisting medium A rocket of initial mass M , of which M — iii is fuel, burns 
its fuel at a constant rate k and ejects the exhaust gases with constant speed w. The rocket 
starts from rest and moves through a medium that exerts the resistance force —ekv, where v is 
the forward velocity of the rocket, and e is a small positive constant. Gravity is absent. Find 
the maximum speed V achieved by the rocket. Deduce a two term approximation for V , valid 
when € is small. 

10.10 Two-stage rocket A two-stage rocket has a first stage of initial mass M i, of which 
(1 — r])M\ is fuel, a second stage of initial mass Mi, of which (1 — r))Mi is fuel, and an inert 
payload of mass iiiq. In each stage, the exhaust gases are ejected with the same speed u. The 
rocket is initially at rest in free space. The first stage is fired and, on completion, the first 
stage carcass (of mass r]M\) is discarded. The second stage is then fired. Find an expression 
for the final speed V of the rocket and deduce that V will be maximised when the mass ratio 
a = Mi/(M\ + M 2 ) satisfies the equation 

a 2 -\-lpa-fi = 0, 

where fi = mo/(M\ + M 2 ). [Messy algebra.] 

Show that, when is small, the optimum value of a is approximatelely /3 1 / 2 and the 
maximum velocity reached is approximately 2 u In y, where y = l/ty 

10.11* A raindrop falls vertically through stationary mist, collecting mass as it falls. The 
raindrop remains spherical and the rate of mass accretion is proportional to its speed and the 
square of its radius. Show that, if the drop starts from rest with a negligible radius, then it has 
constant acceleration g /7. [Tricky ODE.] 

Collisions 

10.12 A body of mass 4m is at rest when it explodes into three fragments of masses 2m, m 
and m. After the explosion the two fragments of mass m are observed to be moving with the 
same speed in directions making 120° with each other. Find the proportion of the total kinetic 
energy carried by each fragment. 

10.13 Show that, in an elastic head-on collision between two spheres, the relative velocity of 
the spheres after impact is the negative of the relative velocity before impact. 

A tube is fixed in the vertical position with its lower end on a horizontal floor. A ball of 
mass M is released from rest at the top of the tube followed closely by a second ball of mass 
m. The first ball bounces off the floor and immediately collides with the second ball coming 
down. Assuming that both collisions are elastic, show that, when m/M is small, the second 
ball will be projected upwards to a height nearly nine times the length of the tube. 
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10.14 Two particles with masses m \, in 2 and velocities v \, vj collide and stick together. Find 
the velocity of this composite particle and show that the loss in kinetic energy due to the 
collision is 


in 1 in 2 

2 (ni\ + m 2 ) 


l»i - V2I 2 • 


10.15 In an elastic collision between a proton moving with speed u and a helium nucleus at 
rest, the proton was scattered through an angle of 45°. What proportion of its initial energy 
did it lose? What was the recoil angle of the helium nucleus? 


10.16 In an elastic collision between an alpha particle and an unknown nucleus at rest, the 
alpha particle was deflected through a right angle and lost 40% of its energy. Identify the 
mystery nucleus. 


10.17 Some inequalities in elastic collisions Use the elastic scattering formulae to show the 
following inequalities: 

(i) When in \ > in 2 , the scattering angle 0\ is restricted to the range 0 < 6\ < sin -1 ( 1112 /m \). 

(ii) If in 1 < m 2 , the opening angle is obtuse, while, if in \ > in 2 , the opening angle is acute. 

(iii) 

E 2 4 in 1 in 2 

£0 ~~ (in 1 + M2) 2 

10.18 Equal masses Show that, when the particles are of equal mass, the elastic scattering 
formulae take the simple form 

$i = jif/ O2 = — 5^ 0 = \tt 

where 1 (r is the scattering angle in the ZM frame. 

In the scattering of neutrons of energy E by neutrons at rest, in what directions should the 
experimenter look to find neutrons of energy |£? What other energies would be observed in 
these directions? 


£0 


= cos 2 ii If 


£2 

£0 


= sin 2 ii/r 



10.19 Use the elastic scattering formulae to express the energy of the scattered particle as a 
function of the scattering angle, and the energy of the recoiling particle as a function of the 
recoil angle, as follows: 

E\ 1 + y 2 cos20i+2y cos^i (l — y 2 sin 2 0i)^ 2 £ 2 4y 2 

£0 = (y + l ) 2 

Make polar plots of £j/£o as a function of 6\ for the case of neutrons scattered by the nuclei 
of hydrogen, deuterium, helium and carbon. 

Two-body problem and two-body scattering 

10.20 Binary star The observed period of the binary star Cygnus X-l (of which only one 
component is visible) is 5.6 days, and the semi-major axis of the orbit of the visible component 


£0 (y + l ) 2 
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is about 0.09 AU. The mass of the visible component is believed to be about 20Mq. Estimate 
the mass of its dark companion. [Requires the numerical solution of a cubic equation.] 

10.21 In two-body elastic scattering, show that the angular distribution of the recoiling parti¬ 
cles is given by 


4 cos 02 o ZM {n — 202), 


where o ZM (x/r) is defined by equation (10.32). 

In a Rutherford scattering experiment, alpha particles of energy E were scattered by a 
target of ionised helium. Find the angular distribution of the emerging particles. 

10 . 22 * Consider two-body elastic scattering in which the incident particles have energy Eq. 
Show that the energies of the recoiling particles lie in the interval 0 < E < E max , where 
£max = 4y£o/0 + y) 2 . Show further that the energies of the recoiling particles are distributed 
over the interval 0 < E < E max by the frequency distribution 



where o 7M is defined by equation (10.32), and 



xf/ = 2 sin 


In the elastic scattering of neutrons of energy Eq by protons at rest, the energies of the recoiling 
protons were found to be uniformly distributed over the interval 0 < E < E o, the total cross 
section being A. Find the angular distribution of the recoiling protons and the scattering cross 
section of the incident neutrons. 

Integrable systems 

10.23 A particle Q has mass 2m and two other particles P. R , each of mass in , are connected 
to Q by light incxtensiblc strings of length a. The system is free to move on a smooth hori¬ 
zontal table. Initially P, Q R arc at the points (0, a), (0, 0), (0, —a) respectively so that they 
lie in a straight line with the strings taut. Q is then projected in the positive x -direction with 
speed u. Express the conservation of linear momentum and energy for this system in terms of 
the coordinates x (the displacement of Q ) and 0 (the angle turned by each of the strings). 
Show that 0 satisfies the equation 



and deduce that P and R will collide after a time 
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mechanical values numerically for ka up to about 30 (the calculation becomes increasingly 
difficult as ka increases), using about 100 terms of the series. 

The author’s results arc shown in Figure 10.11. The quantum mechanical value for cr(0) 
does approach the classical value for larger scattering angles, but behaves very erratically for 
small scattering angles. Also, the value of S tends to twice the value expected! Your physics 
lecturer will be pleased to explain these interesting anomalies. 
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The angular momentum principle 

and angular momentum conservation 


KEY FEATURES 

The key features of this chapter are the angular momentum principle and conservation 
of angular momentum. Together, the linear and angular momentum principles provide the 
governing equations of rigid body motion. 


This chapter is essentially based on the angular momentum principle and its conse¬ 
quences. The angular momentum principle is the last of the three great principles of multi¬ 
particle mechanics* that apply to every mechanical system without restriction. Under 
appropriate conditions, the angular momentum of a system (or one of its components) is 
conserved, and we use this conservation principle to solve a variety of problems. 

Together, the linear and angular momentum principles provide the governing equa¬ 
tions of rigid body motion; the linear momentum principle determines the translational 
motion of the centre of mass, while the angular momentum principle determines the rota¬ 
tional motion of the body relative to the centre of mass. In this chapter, we restrict our 
attention to the special case of planar rigid body motion. Three-dimensional motion of 
rigid bodies is considered in Chapter 19. 

11.1 THE MOMENT OF A FORCE 

We begin with the definition of the moment of a force about a point , which is a 
vector quantity. The moment of a force about an axis, a scalar quantity, is the component 
along the axis of the corresponding vector moment. 

Definition 11.1 Moment of a force about a point Suppose a force F acts on a parti¬ 
cle P with position vector r relative to an origin O. Then ko, the moment of the force 
F about the point O is defined to be 


ko = /' x F, 


( 11 . 1 ) 


* The other two are the energy and linear momentum principles. 
' Also called torque, especially in the engineering literature. 
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right (+) left (-) 


FIGURE 11.1 Left: Geometrical interpretation of the vector moment ko = rx F. Right: The 
right- and left-handed senses around the ‘axis’ { O. n}. 

a vector quantity. If the system of particles P\, Pj, ■■■, Pn . with position vectors r \, rj, 
..., r N are acted upon by the system of forces F F 2 , ■ ■ ■, F n respectively, then K q , 
the total moment of the system of forces about O is defined to be the vector sum of the 
moments of the individual forces, that is, 


N 



( 11 . 2 ) 


/'=! 


Since any fixed point can be taken to be the origin O, there is no loss of generality in the 
above definitions. However, there are occasions on which it is convenient to take moments 
about a general point A whose position vector is a. To find K A , we simply replace /*, in 
the above definitions by the position vector of P , relative to A, namely, /*, — a. This gives 


N 



(11.3) 


1=1 

It follows that K A and K o are simply related by 

K A = K 0 ~axF, 

where F is the resultant force. Hence, if F is zero, the total moment of the forces { F, } is 
the same about every point. Such a force system is said to be a couple with moment K. 

Geometrical interpretation of vector moment 

The formula (11.1) has a nice geometrical interpretation. Let V be the plane that contains 
the origin and the force F, as shown in Figure 11.1. Let n be a unit vector normal to V, 
and suppose that F acts in the right-handed (or positive) sense around the ‘axis’ {O, «}. 
(This is the case shown in Figure 11.1.) Then, from the definition (1.4) of the vector 
product, 

K o = r x F = ^|r||F| sin(j7r + ar)J n = F (r cos a) n = (F x p) n , 

where F is the magnitude of F and p (= OM) is the perpendicular distance of O from 
the Tine of action’ of F. Thus, K q has magnitude F x p and points in the n-direction. If 
F has the left-handed (or negative) sense around {O, «}, then Kq = — (F x p) n. 
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n 


F 


FIGURE 11.2 The moment of the force F 
about the axis {0< /?} is p x (F • 0). 



o i 


Motion in a plane 

Suppose we have a system of particles that lie in a plane, and the forces acting on the 
particles also lie in this plane. Such a system is said to be two-dimensional. Then the 
total moment K q of these forces about a point O of the plane is given by 



where the plus (or minus) sign is taken when the sense of F, around the axis {O, «} 
is right- (or left-) handed. This formula explains why, in two-dimensional mechanics, 
the moment of a force can be represented by the scalar quantity ± F x p. In the two- 
dimensional case, the directions of all the moments are parallel, so that they add like 
scalars. However, in three dimensional mechanics, the moments have general directions 
and must be summed as vectors. 

Moments about an axis 

Definition 11.2 Moment of a force about an axis The component of the moment K o 
in the direction of a unit vector n is called the moment of F about the axis* {O , n}; it is 
the scalar quantity K q • n. 

This axial moment can be written (see Figure 11.2) 


Ko • n = (r x F) • n = (nxr) • F = ((/• sin cr) (f >) • F 
= p(F • 0), 


where p is the distance of P from the axis {O, n) and (p is measured around the axis. The 
direction of the unit vector 0 is called the azimuthal direction around the axis [O, n}. 
Thus F • 0 is the azimuthal component of F. 


* This ‘axis’ is merely a directed line in space. It does not neccessarily correspond to the rotation of any 
rigid body. 
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Definition 11.3 Angular momentum about a point Suppose a particle P of mass m 
has position vector r and velocity v. Then lo, the angular momentum of P about O is 
defined to be 


l 0 =rx(mv), (11.4) 

a vector quantity. If the system of particles P\. Pi, ■■ , Pn , with masses m \, m i ,..., w yv, 

have position vectors r\, ri ./> and velocities v\,vi,... respectively, then Lo, 

the angular momentum of the system about O, is defined to be the vector sum of the 
angular momenta of the individual particles, that is, 

N 

L 0 = ^ r, x (w/U/). (11.5) 

1=1 

The corresponding formula for angular momentum about a general point A is therefore 

N 

L a = ^2 (,n - a)x(niiVi), 
i =1 

from which it follows that L A and Lo are simply related by 

L a = L 0 - ax P , 

where P is the total linear momentum of the system. 

The geometrical interpretation of the angular momentum of a particle is similar to that 
of moment of a force (see Figure 11.1). Let V be the plane that contains O, P and the 
velocity v, and let n be a unit vector normal to V. Then 

Lo = i {mv x p) n, 

where v is the magnitude of v and p is the perpendicular distance of O from the line 
through P parallel to v. The ± sign is decided by the sense of v around the axis {O , «}, 
as shown in Figure 11.1. 

Example 11.3 Calculating the angular momentum of a particle 

The position of a particle P of mass m at time t is given by x = a0 2 , y = 2 ad, 
z — 0, where 0 = 6{t). Find the angular momentum of P about the point B(a, 0, 0) 
at time t. 

Solution 

The position vector of the particle relative to B at time t is 

r — b — (a0 2 i + 2 aO y) - ai = a [(0 2 - l)i + 20 y] 
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FIGURE 11.3 The particle P slides on the 
inside surface of the axially symmetric bowl 
Z = f(p). 



x 


y 


and the velocity of the particle at time t is 


v = 



The angular momentum of the particle about B at time t is therefore 


Lb = (r — b)x ( mv ) = 2 ma 2 0 | ’ {0 2 — l)i + 26 yj x [Oi + j] 
= -2 ma 2 {0 2 + 1 )6k. U 


Angular momentum about an axis 

Definition 11.4 Angular momentum about an axis The component of the angular 
momentum Lo in the direction of a unit vector n is called the angular momentum of 
P about the axis {O , «}; it is the scalar quantity Lo • n. 

By that same argument as was used for moments about an axis, the angular momentum of 
a particle of mass m and velocity v about the axis {O, nj can be written in the form 



( 11 . 6 ) 


where p is the perpendicular distance of the particle from the axis and t>*0 is the azimuthal 
component of v around the axis. 

Example 11.4 Particle sliding inside a bowl 

A particle P of mass m slides on the inside surface of an axially symmetric bowl. 
Find its angular momentum about its axis of symmetry in terms of the coordinates p, 
<p shown in Figure 11.3. 


Solution 


In order to express Lo m k in terms of coordinates, we draw a velocity diagram for 
the system as explained in section 10.10. The velocities v p and v$, corresponding to 
the coordinates p and <p, have the directions shown in Figure 11.3. These two veloci¬ 
ties are perpendicular, with the contribution in the azimuthal direction around the 
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FIGURE 11.4 The rigid body B rotates about 
the fixed axis { O. n } with angular velocity to 

vertical axis {0, k\. It follows that v • 0 = tty = ptp. The required axial angular 
momentum is therefore 

Lo • k = mp(v • <f>) = mp(pf) = ///p 2 0. 

Just for the record, the velocity v p is the (vector) sum of p radially outwards and 
z vertically upwards. Note that z is not an independent quantity. If the equation of 
the bowl is z = f{p), then z = /'(p)p. In particular then, the kinetic energy of P is 
given by 

T = \m [p 2 + (/'(p)p) 2 + (p0) 2 ] • 

and its potential energy by V = nigf(p). ■ 

11.3 ANGULAR MOMENTUM OF A RIGID BODY 

The probem of finding the angular momentum of a moving rigid body in the 
general three-dimensional case is tricky and is deferred until Chapter 19. In the present 
chapter we essentially restrict ourselves to the case of planar rigid body motion, for which 
it is sufficient to find the angular momentum of the body about its axis of rotation. This 
axis may be fixed (as in the armature of a motor) or, more generally, may be the instan¬ 
taneous rotation axis through the centre of mass of the body (as in the case of a rolling 
penny). In this section we consider only the case of rotation about a fixed axis; the case of 
planar motion is treated in section 11.6. 

Consider a rigid body B rotating with angular velocity to about the fixed axis {0, «}, 
as shown in Figure 11.4. Then the angular momentum of the body about this axis is 

( N \ N N 

l o ] ) • n = = • 0 ), 

;=i / i=[ i=i 

where p, is the perpendicular distance of w, from the axis, and v, • 0 is the azimuthal 
component of i>, around the axis (see formula (11.6)). But, since the body is rigid, the 
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11.4 THE ANGULAR MOMENTUM PRINCIPLE 

We now derive the fundamental result which relates the angular momentum of 
any system to the external forces that act upon it - the angular momentum principle. 

Consider the general multi-particle system S which consists of particles P \, P 2 ,..., 
F/v, with masses m\, m 2 ,..., w/v and velocities v\, V 2 ,..., , as shown in Figure 9.1. 

Suppose that S is acted upon by external forces F, and internal forces G,y, as shown in 
Figure 9.3. Then the equation of motion for the particle F, is 

dv N 

w ( — = Fi + ^ Gij, (11.10) 

7 7=1 

where, as in Chapter 9, we take G,y = 0 when / = j. Then the rate of increase of the 
angular momentum of the system S about the origin O can be written 


dL 0 

dt 


N 


N 


dt 


^ r/ x iniiVi) J = ^ 


W = 1 


1=1 


r, x mi 


dvj 

‘~di 


+ r -, x (nijVj) 



since r, x (w,•»,•) = m, r, x v / = 0. On using the equation of motion (11.10), we obtain 

N N 

i + J2J2 riX G u 

'■= 17=1 


,=2 \ j =1 

where Ko is the total moment about O of the external forces. We have also grouped 
the terms of the double sum in pairs and omitted those terms known to be zero. Now 
the internal forces { G /y} satisfy the Third Law, which means that G,y must be equal and 
opposite to Gy/, and that G,y must be parallel to the line F, Fy. It follows that 



N N 

Fi + E G ‘i = E r ' x F 

N // —1 

= K o + J2 S ( r/ x G 'j + r J x G p) 


dL 


N 


O 


(It 


-E" 


x 


r ( - x Gij + rjxG y ,• = r, x G,y - ry x G,y = (r,- - ry) x G,y = 0, 


since G;y is parallel to the vector r,- — ry. Thus each pair of terms of the double sum in 
equation (11.11) is zero and we obtain 


dL 0 

dt 


= K 0 , 


which is the angular momentum principle. Since any fixed point can be taken to be the 
origin, this proves that: 
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FIGURE 11.5 The particle slides on the table while the string is pulled 
down through the hole. 


Conservation of angular momentum about an axis 

Even when K A ^ 0 it is still possible for angular momentum to be conserved about a 
particular axis through A. Let n be a fixed unit vector and A a fixed point so that {A, n) 
is a fixed axis through the point A. Then 




n) = 


dL/ 

17 


H + L 


dn 

It 


d L a 

dt 


n = K a • n 


Hence, if K A • n = 0 at all times, it follows that L A • n is conserved. This result can be 
stated as follows: 


Conservation of angular momentum about an axis 

If the external forces acting on a system have no total moment about a fixed axis, 
then the angular momentum of the system about that axis is conserved. The same 
applies for a moving axis which passes through G and maintains a constant direc¬ 
tion. 


In our first example, angular momentum conservation is sufficient to determine the 
entire motion. 

Example 11.7 Pulling a particle through a hole 

A particle P of mass /// can slide on a smooth horizontal table. P is connected to a 
light incxtcnsiblc string which passes through a small smooth hole O in the table, so 
that the lower end of the string hangs vertically below the table while P moves on top 
with the string taut (see figure 11.5). Initially the lower end of the string is held fixed 
with P moving with speed w on a circle of radius a. The string is now pulled down 
from below in such a way that the string above the table has the length /•(/) at time /. 
Find the velocity of P and the tension in the string at time t. 

Solution 

We must first establish that some component of angular momentum is conserved in 
this motion. The forces acting on P are gravity, the normal reaction of the smooth 
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table, and the tension in the string. Since the first two are equal and opposite and the 
tension force points towards O , it follows that K o = 0. Thus, however the string is 
pulled, Lo is conserved in the motion of P. 

Now we must calculate Lo- As in the case of central field orbits, Lo is perpen¬ 
dicular to the plane of motion and conservation of Lo is equivalent to conservation 
of the axial angular momentum Lo * k. Hence, as in orbital motion, 

Lo • k = mr 2 0 = L, 

where the constant L is given by the initial conditions to be L = mau. Hence, in the 
motion of P , the conservation equation 

2 * 

////- 0 = mau 

is satisfied. Since r(t) is given, this equation is sufficient to determine the motion of 
P. In particular, the velocity of P at time t is given by 

v = r r + ( rO ) 0 = r r + ^—j 0, 

from which we see that the transverse velocity of P tends to infinity as r tends to zero. 

The string tension T can be found from the radial equation of motion for P , 
namely. 


m (r - r0 2 ) = -T, 

which gives 

T =m(i 6 2 m ^!-JL-r 

For example, in order to pull the string down with constant speed, the applied tension 
must be 



2 2 
mau 



This tends rapidly to infinity as r tends to zero, making it impossible to pull the 
particle through the hole! ■ 

Our second example belongs to a class of problems that could be called ‘before and 
after problems’. We have encountered the same notion before. In elastic collision prob¬ 
lems, the linear momentum and energy of the system are conserved and these conservation 
laws are used to relate the initial state of the system ( before ) to the final state (after). This 
provides information about the final state that is independent of the nature of the parti¬ 
cle interaction. Conservation of angular momentum can be exploited in the same way. 
In the following example, angular momentum conservation is sufficient to determine the 
final state uniquely. 
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FIGURE 11.6 The beetle and the ball: the ball is smoothly pivoted about a vertical diameter and the 
beetle crawls on the surface of the ball. 

Example 11.8 The beetle and the ball 

A uniform ball of mass M and radius a is pivoted so that it can turn freely about one 
of its diameters which is fixed in a vertical position. A beetle of mass m can crawl 
on the surface of the ball. Initially the ball is rotating with angular speed £2 with 
the beetle at the ‘North pole’ (see Figure 11.6 (left)). The beetle then walks (in any 
manner) to the ‘equator’ of the ball and sits down. What is the angular speed of the 
ball now? 

Solution 

We must first establish that some component of angular momentum is conserved. 
The external forces acting on the system of ‘beetle and ball’ are shown in Figure 11.6 
(centre). The forces X and Y are the constraint forces exerted by the pivots. The total 
moment of the external forces about O is therefore 

Ko = 0 x (—Mgk) + r x (—mgk) + (ak) x X + (—ak ) x Y. 

It follows that 


K 0 -k = 0, 


since all the resulting triple scalar products contain two k's. Hence Lo • k, the angu¬ 
lar momctum of the system about the rotation axis, is conserved, irrespective of the 
wanderings of the beetle. It follows that this axial angular momentum is the same 
after as it was before. 

In the initial state, the angular momentum of the ball about its rotation axis is 
given by / £2, where / = 2Ma 2 /5. Initially the beetle has zero velocity so its angular 
momentum is zero. Hence the initial value of the axial angular momentum is 

L 0 -k = £2. 

In the final state the ball has an unknown angular velocity £2' and axial angular 
momentum 2Ma 2 Q!/5. The velocity of the beetle is entirely azimuthal and is equal 
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to£2'a. Hence, on using the formula (11.6), the axial angular momentum of the beetle 
is given by mp(v • 0) = ma{Q.a). The final value of the axial angular momentum is 
therefore 

Lo • k = O! + ma{Q!a) = \(2M + 5m)a 2 Q!. 

Since Lo • k is known to be conserved it follows that 

\{2M + 5 m)a 2 Q! = \Ma 2 Q., 

and hence the final angular velocity of the ball is 

2 M 

2 M + 5m 

Question Change in kinetic energy 

Find the change in kinetic energy of the system caused by the beetle’s journey. 

Answer 

The initial and final kinetic energies of the system are 

±(§A/a 2 )s2 2 and \ (^Ma^Q.' 2 + \m\aQ.') 2 

respectively. On using the value of £2' found above and simplifying, the kinetic 
energy of the system is found to decrease by 

mMa 2 Q, 2 
2M + 5m' 




Question Red hot beetle 

Docs this loss of energy mean that the beetle arrives in a red-hot condition? 

Answer 

Your mechanics lecturer will be pleased to answer this question. ■ 

Our last example, the spherical pendulum, is a system with two degrees of freedom. 
By using both angular momentum and energy conservation, a complete solution can be 
found. 


Example 11.9 The spherical pendulum: an integrable system 

A particle P of mass m is suspended from a fixed point O by a light inextensi- 
ble string of length a and moves with the string taut in three-dimensional space (the 
spherical pendulum). Show that angular momentum about the vertical axis through 
O is conserved and express this conservation law in terms of the generalised coor¬ 
dinates 0 , (f >, as shown in Figure 11.7. Obtain also the corresponding equation for 
conservation of energy. 
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— mgk 

External forces 



a sin 0)4> 


Velocity diagram 


FIGURE 11.7 The spherical pendulum with generalised coordinates 6 and (p . Left: the external 
forces. Right: the velocity diagram. 


Initially the string makes an acute angle a with the downward vertical and the 
particle is projected with speed u in a horizontal direction at right angles to the string. 
Determine the constants of the motion, and deduce an equation satisfied by 0(f) in 
the subsequent motion. 

Solution 

The external forces on the particle arc gravity and the tension in the string (sec Figure 
11.7 (left)). Hence, 

Ko = r x {—mgk) + rxT = —mgr x k, 
the second term being zero since r and T are parallel. It follows that 

K o • k = —mg(r x k) ♦ k = 0, 

since the triple scalar product has two k' s. Hence Lq • k is conserved. 

In order to express this conservation law in terms of coordinates, we draw a veloc¬ 
ity diagram for the system as explained in section 10.10. The velocities corresponding 
to the coordinates 0 and 0 are ad and p0 (= {a sin 0)0) respectively in the direc¬ 
tions shown in Figure 11.7 (right). These two velocities are perpendicular, with the 
{a sin 0)0 contribution in the azimuthal direction around the vertical axis {(7, k). It 
follows that v • 0 = (a sin 0)0. The required axial angular momentum is therefore 

Lo ’ k = mp{v • 0) = m {a sin0)(o sin 0 0) = ma 2 sin 2 0 0 

and conservation of Lo • k is expressed by 

ma 2 sin 2 0 0 = L , 

where the axial angular momentum L is a constant of the motion. 
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FIGURE 11.8 The calculated path of the spherical pendulum for the case a = 7r/6 and u 2 /ag = 1.9. 
Left: After four oscillations of 0. Right: After ten oscillations of 0. The surrounding boxes show the 
perspective. 


Answer 

Despite the problem being called integrable, the integrals arising from the separation 
procedure cannot be evaluated and no explicit solution is possible. However, equation 
(11.19) has the form of an energy equation for a system with one degree of freedom. 
We have met this situation before with the radial motion equation in orbit theory and 
the deductions we can make are the same. Because the left side of (11.19) is positive, 
it follows that the motion is restricted to those values of 0 that make the function 

( cos a + cos 6 lag \ 

-r-r- 

sin 2 0 w 2 ) 


positive. Moreover, maximium and minimum values of 0 can only occur when 
F{0) = 0. 

Since F(a) — 0, 0 — cr is one extremum* and any other extremum must be a 
root of the equation G(0) = 0, where 

cos a + cos 0 2a g 
sin 2 0 u 2 

Whether a is a maximum or minimum point of 0 depends on the value of the initial 
projection speed u. On differentiating equation (11.19) with respect to t, we find that 
the initial value of 0 is given by 


1 - -1 

( COS O' 

a s\ 

le=. a 2 \ 

^sin 2 a 



* This is because of the form of the initial conditions. 
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so that 9 initially increases if u 2 /ag > sin 2 a/coscr, and 9 initially decreases if 
u 2 /ag < sin 2 a/ cos a. (The critical case corresponds to the special case of conical 
motion.) Suppose that the first condition holds. Then d m j n = cr and 9 mdX must be a 
root of the equation G(9) = 0. Since G(a) > 0 and G(n — a) < 0, such a root does 
exist and is less than n — a. 

For example, consider the particular case in which a = n/3 and u 2 = Aag. Then 
the equation G(9) = 0 simplifies to give 

cos 9 (cos 9 + 2) = 0, 

from which it follows that 0 m ax = n/ 2. Hence, in this case, 9 oscillates periodically 
in the range tt/3 < 9 <tt/2. 

At the same time as the coordinate 9 oscillates, the coordinate (p increases in 
accordance with equation (11.18). Hence, during each oscillation period t of the 
coordinate 9, 0 increases by 

m sin cr f r dt 

«> = - / —— 

a Jo sin - 9 

This pattern of motion repeats itself with period r, but the motion is only truely peri¬ 
odic if it eventually links up with itself; this occurs only when the initial conditions 
arc such that O/tt is a rational number. 

Figure 11.8 shows an actual path of the spherical pendulum, corresponding to the 
initial conditions a = n/6 and u 2 /g = 1.9. The results are entirely consistent with 
the theory above. ■ 

11.6 PLANAR RIGID BODY MOTION 
What is planar motion? 

Planar motion is a generalisation of two-dimensional motion in which two-dimensional 
methods are still valid. The complications of the full three-dimensional theory (repre¬ 
sented by the two vector equations (11.15)) melt away to leave three scalar equations, 
which often have a very simple fonn. This enables a variety of fascinating problems to be 
solved in simple closed form. Planar rigid body motion is good value for money! 

A system is said to be in planar motion if each of its particles moves in a plane 
and all of these planes are parallel to a fixed plane V called the plane of motion. For 
example, any purely translational motion of a rigid body in which G moves in the plane 
V is a planar motion, as is any purely rotational motion about an axis through G that is 
perpendicular to V. The same is true when both of these motions are present together. For 
example, a cylinder (of any cross-section) rolling down a rough inclined plane is in planar 
motion. It is not necessary for the bodies that make up our system to be cylinders, nor 
even to be bodies of revolution. We merely suppose that each constituent of the system 
should have reflective symmetry in the plane of motion,* as shown in Figure 11.9. 


* The reason for this symmetry restriction is that, if a body of completely general shape (a potato, say) were 
started in planar motion and moved under realistic forces (uniform gravity, say), then the motion would 
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FIGURE 11.10 The pulley rotates about the 
fixed horizontal axis {0. j] and the 
suspended particle moves vertically. 



k 

A 

-► i 


A special case arises when the body is rotating about a fixed axis like the elliptical 
crank shown in Figure 11.9. Although the equations (11.21) could still be used, this is not 
the quickest way. Let the fixed axis be {O, j}, where O lies in the plane of motion (see 
Figure 11.9). If the angular momentum principle is now applied about O (instead of G), 
then the unknown reactions exerted by the pivots make no contributuion to Kq and do not 
appear in the third equation of (11.21). The first two equations in (11.21) serve only to 
determine these reactions, once the motion has been calculated. Hence, unless the pivot 
reactions are actually required, it is sufficient to use the single equation: 


Rigid body equation - fixed axis 


dco 

'°17 = k ° 


( 11 . 22 ) 


In this case also, the moment of inertia Io is constant. 

Example 11.10 Planar motion: mass hanging from a pulley 

A circular pulley of mass M and radius a is smoothly pivoted about the axis {(9, j}, 
as shown in Figure 11.10. A light inextensible string is wrapped round the pulley 
so that it does not slip, and a particle of mass m is suspended from the free end. 
The system undergoes planar motion with the particle moving vertically. Find the 
downward acceleration of the particle. 

Solution 

This problem is most easily solved by using energy conservation, but it is instruc¬ 
tive to solve it as a planar motion problem to illustrate the difference between the 
two approaches. In the energy conservation approach, the whole system is considered 
to be a single entity, and in this case, the string tensions do no total work and need 
not be considered. In the setting of planar motion however, the system consists of 
two elements, (i) a particle moving in a vertical straight line, and (ii) a rigid pulley 
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rotating about a fixed horizontal axis. For each constituent, the tension force exerted 
by the string is external. The string tensions (which are equal in this problem) there¬ 
fore appear in the equations of motion. For this reason, the conservation method is 
simpler, but there are many problems where there is no useful conservation principle 
and the planar motion approach is essential. 

Let the particle have downward vertical velocity v, and the pulley have angular 
velocity a> (in the sense shown) at time 1. Since the vector j points into the page, 
this is the positive sense around the axis [O, j}. Thus, in the notation used here, the 
positive sense for angular velocity is clockwise. The same applies to moments and 
angular momenta. 

First consider the motion of the particle. Since the motion is in a vertical straight 
line, the only surviving euation is 


m — = mg — T , 


dt 


where T is the string tension at time t. 

Now consider the motion of the pulley. Since this is rotating about a fixed axis, 
the equation of motion is lodcojdt = Kq, that is. 


since the weight force Mg and the pivot reaction Z have zero moment about O. 
Hence, the unknown tension T can be eliminated to give 



ma -b In — I — mga. 

dt dt J * 


This equation applies whether or not the string slips on the pulley. However, since 
we are given that the string does not slip, v and u> must be related by the no-slip 
condition v = coa. On using this condition in the last equation, we obtain 



This is the downward acceleration of the particle. If the moment of inertia of the 
pulley is \Ma 2 , then the value of this acceleration is [2m/(2m + M)]g. ■ 

Our next example, the cotton reel problem, is a famous problem in planar mechanics. 
The mathematics is elementary, but the solution needs to be interpreted carefully. 

Example 11.11 The cotton reel problem 

A cotton reel is at rest on a rough horizontal table when the free end of the thread is 
pulled horizontally with a constant force T , as shown in Figure 11.11. Given that the 
reel undergoes planar motion,* how does it move? 


* In practice, it is impossible to maintain planar motion in the problem as described (try it). However, the 
problem is the same if the thread is replaced by a broad Hat tape for which planar motion is easier to 


achieve. 
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FIGURE 11.11 The reel is initially at rest on a rough horizontal table, 
when the free end of the thread is pulled with a constant force. 


Solution 

Suppose the ends of the reel have radius a , the axle wound with thread has radius 
b (with b < a), and the whole reel together with its thread has mass M. (We will 
neglect the mass of any thread pulled from the reel.) Let the reel have horizontal 
velocity V and angular velocity co in the directions shown at time t . Note that we are 
not presuming the variables V, co (or F) must take positive values. The signs of these 
variables will be deduced in the course of solving the problem. 

The external forces on the reel are the string tension T and the fricton force F at 
the table. (The weight force and the normal reaction at the table cancel.) Hence, the 
equations of motion for the reel are 

dV 

M - = T-F , (11.23) 

dt 

Mk 2 — = aF - bT, (11.24) 

dt 

where Mk 2 is the moment of inertia of the reel about {G, j}. We are not making any 
prior assumption about whether the reel slides or rolls. We will simply presume that 
the friction force F is bounded in magnitude by some maximum F rnax , that is, 

_/ rmax < F < T ina x, (11.25) 

and that F = +f’ max (or — T max ) when the reel is sliding forwards (or backwards). 

Whether the reel slides or rolls depends on v c , the velocity of the contact particle 
C. Since v c = V — coa , it follows by manipulating the equations (11.23), (11.24) that 
v c satisfies the equation 


/ Mk 2 \ dv c 

{WT^b)^- = T ~ yF 

where the constant y is given by 

k 2 + a 2 
k 2 + ab 


(11.26) 


(11.27) 
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Different cases arise depending on how hard one pulls on the thread. 

Strong pull T > yF max 

In this case, the right side of equation (11.26) is certain to be positive so that 
dv c /dt > 0 for all /. Since the system starts from rest, v c = 0 initially and so 
v c > 0 for all 1 > 0. In other words, the reel slides forwards. This in turn implies 
that F = F max so that the equations of motion (11.23), (11.24) become 

dV _T - F max 

~dt ~ M ’ 
dco _ aF max — bT 

~dt a/P • 

These equations imply that the reel slides forwards with constant acceleration and 
constant angular acceleration. Note that to is positive for y F max < T < ( a/b)F max 
and negative for T > ( a/b)F max . 

Gentle pull T < y F max 

In this case, the reel must roll. The proof of this is by contradiction, as follows. 

Suppose that the reel were to slide forwards at any time in the subsequent motion. Then 
there must be a time r, at which v c and dv c /dt arc both positive. The condition v c > 0 implies 
that F = F max when t — r, and the condition dv c /dt > 0 then implies that T > y F max when 
t — r. This is contrary to asumption and so forward sliding can never take place. A similar 
argument excludes backward sliding and so the only possibility is that the reel must roll. 

The reel must therefore satisfy the rolling condition V = coa and this, 
together with the equations of motion (11.23), (11.24), implies that the reel must 
roll forwards with constant acceleration 

dV a(a — b)T 

~dT ~ M (£ 2 + a 2 )' * 

Example 11.12 A circus trick 

In a circus trick, a performer of mass m causes a large ball of mass M and radius a to 
accelerate to the right (see Figure 11.12) by running to the left on the upper surface of 
the ball. The man docs not fall off the ball because he maintains this motion in such 
a way that the angle a shown remains constant. Find the conditions neccessary for 
such a motion to take place. 

Solution 

Suppose the motion is planar and that, at time t, the ball has velocity V in the i- 
dircction and angular velocity a> (= V/a) around the axis {O, j}. If the man is to 
maintain his position on the ball, then he must run up the surface of the ball (towards 
the highest point) with velocity V. This maintains his vertical height and his acceler- 
aton is then the same as that of the ball, namely (dV / dt)i. 
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mg 


The equations of motion for the man arc therefore 
dV 

m -= Ni sin a — F 2 cos a 

dt 

0 = TV 2 cos a + F 2 sin a — mg 

and the equations of motion for the ball arc 

dV 

M -= F\ — N 2 sin a + Fi cos a 

dt 

0 = N\ — N 2 cos a — F 2 sin a — Mg 
dco 

In — = 0 F 7 — aF\ 

U dt Z 

where lo is the moment of inertia of the ball about {O, j}. 

These five equations, together with the rolling condition V = a>a, are sufficient to 
determine the six unknowns dV /dt, dco/dt, F\,N\, F 2 and N 2 . After some algebra, 
the solution for the forward acceleration of the ball turns out to be 

dV mg sin a 

dt M + (Iq/o 2 ) + m (l + cos a) 

Hence the motion is possible for any acute angle a provided that the performer can 
accelerate relative to the ball with this acceleration. For the case in which the ball is 
hollow, the masses of the man and the ball arc equal, and a = 45°, the acceleration 
required is approximately 0.21 g. ■ 

11.7 RIGID BODY STATICS IN THREE DIMENSIONS 

Although we are not yet able to attempt problems in which a rigid body under¬ 
goes three-dimensional motion, we are able to solve problems in which a rigid body is 
in equilibrium under a three-dimensional system of forces. In equilibrium, the linear and 
angular momentum of the body are known to be zero, so that the rigid body equations 
become 
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FIGURE 11.13 The rectangular panel ARCD 
rests on the rough floor z = 0 and leans 
against the smooth wall x = 0. 



Equations of rigid body statics 

F = 0 K a = 0 


(11.28) 


where A is any fixed point of space.* In other words, when a system is in equilibrium, the 
resultant force and the resultant moment of the external forces must he zero. 

Since there is no motion, one may wonder what there is left to calculate in statical 
problems. However, the body is usually supported or restrained in some prescribed way, 
and it is the unknown constraint forces that are to be determined. If these constraint forces 
can be determined solely from the equilibrium equations (11.28), then the problem is said 
to be statically determinate.^ 

Example 11.13 Leaning panel 

A rectangular panel ABC D of mass M is (rather carelessly) placed with its edge AB 
on the rough horizontal floor z = 0 and with the vertex D resting against the smooth 
wall x = 0, as shown in Figure 11.13. The four vertices of the panel are at the points 
A( 2, 0, 0), B( 6 , 4, 0), C (4, 6, 6) and D{ 0, 2, 6) respectively. Given that the panel 
does not slip on the floor, find the reaction force exerted by the wall. 

Solution 

The external forces acting on the panel arc the normal reaction of the wall Pi, the 
weight force —Mgk, and the reaction of the floor on the edge AB. Now the reaction 
of the floor is distributed along the edge AB and, although we could treat it as such. 


* It is unneccessary and inconvenient to restrict moments to be taken about G. 

' Not all problems are statically determinate by any means. In the two-dimensional theory, if a heavy 
rigid plank is resting on three or more supports, then the individual reactions at the supports cannot be 
found from the equlibrium equations. What this means is that modelling the plank as a rigid body is 
not appropriate in such a problem. One should instead model the plank as a deformable body, solve the 
problem using the theory of elasticity, and then pass to the rigid limit. 
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FIGURE 11.14 The rod AB is in equilibrium 
with A on a rough lloor and R resting against 
a smooth wall. The rod is prevented from 
falling by the string RC. 



this is an irrelevant complication. We will therefore suppose that (in order to avoid 
damage to the floor) the panel has been supported on two small pads beneath the 
corners A and B , in which case the reaction of the floor consists of the forces X and 
Y as shown. 

We now apply the equilibrium conditions. The condition F = 0 yields 

Ni + X + Y — Mgk = 0. (11.29) 

If we take moments about the corner A. the reaction X makes no contribution and the 
condition K a = 0 becomes 

AD x(Ni)+ ABxY+AG x(-Mgk) = 0. 

On inserting the given numbers, this equation becomes 

N(6j - 2k) + (4 i + 4 j)xY - Mg (3 i - j) = 0. (11.30) 

The six scalar equations in (11.29), (11.30) contain the seven scalar unknowns X, 
Y,N which means that the problem is not statically determinate. However, this does 
not stop us from finding N , since Y can be eliminated from equation (11.30) by taking 
the scalar product with the vector 4 1 +4 j. (This is equivalent to taking moments about 
the axis AD so that both X and Y disappear to leave N as the only unknown.) This 
gives the reaction exerted by the wall to be /V = Mg/3. ■ 

Example 11.14 Leaning rod 

A rough floor lies in the horizontal plane z = 0 and the plane x = 0 is occupied by a 
smooth vertical wall. A uniform rod of mass M has its lower end on the floor at the 
point (o, 0, 0) and its upper end rests in contact with the wall at the point (0, b, c ). The 
rod is prevented from falling by having its upper end connected to the point (0, 0, c) 
by a light inextensible string. Given that the rod does not slip, find the tension in the 
string and the reaction exerted by the wall. 
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Problems on Chapter 11 


Answers and comments are at the end of the book. 

Harder problems carry a star (*). 

11.1 Non-standard angular momentum principle If A is a generally moving point of space 
and La is the angular momentum of a system <5 about A in its motion relative to A, show that 
the angular momentum principle for S about A takes the non-standard form 



dL A 


dt 


|Begin by expanding the expression for L A -\ 

When does this formula reduce to the standard form? [This non-standard version of the 
angular momentum principle is rarely needed. However, see Problem 11.9.] 

Problems soluble by conservation principles 

11.2 A fairground target consists of a uniform circular disk of mass M and radius a that can 
turn freely about a diameter which is fixed in a vertical position. Initially the target is at rest. 
A bullet of mass in is moving with speed w along a horizontal straight line at right angles to 
the target. The bullet embeds itself in the target at a point distance b from the rotation axis. 
Find the final angular speed of the target. [The moment of inertia of the disk about its rotation 
axis is Ma 2 / 4.] 

Show also that the energy lost in the impact is 



11.3 A uniform circular cylinder of mass M and radius a can rotate freely about its axis of 
symmetry which is fixed in a vertical position. A light string is wound around the cylinder so 
that it does not slip and a particle of mass m is attached to the free end. Initially the system 
is at rest with the free string taut, horizontal and of length b. The particle is then projected 
horizontally with speed u at right angles to the string. The string winds itself around the 
cylinder and eventually the particle strikes the cylinder and sticks to it. Find the final angular 
speed of the cylinder. 

11.4 Rotating gas cloud A cloud of interstellar gas of total mass M can move freely in space. 
Initially the cloud has the form of a uniform sphere of radius a rotating with angular speed £2 
about an axis through its centre. Later, the cloud is observed to have changed its form to that 
of a thin uniform circular disk of radius b which is rotating about an axis through its centre and 
perpendicular to its plane. Find the angular speed of the disk and the increase in the kinetic 
energy of the cloud. 

11.5 Conical pendulum with shortening string A particle is suspended from a support by a 
light inextensible string which passes through a small fixed ring vertically below the support. 
Initially the particle is performing a conical motion of angle 60°, with the moving part of the 
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string of a. The support is now made to move slowly upwards so that the motion remains 


nearly conical. Find the angle of this conical motion when the support has been raised by a 


distance a/2. [Requires the numerical solution of a trigonometric equation.] 

11.6 Baseball bat A baseball bat has mass M and moment of inertia M k 2 about any axis 
through its centre of mass G that is perpendicular to the axis of symmetry. The bat is at rest 
when a ball of mass in, moving with speed u, is normally incident along a straight line through 
the axis of symmetry at a distance b from G. Show that, whether the impact is elastic or not, 
there is a point on the axis of symmetry of the bat that is instantaneously at rest after the impact 
and that the distance c of this point from G is given by be = k 2 . In the clastic case, find the 
speed of the ball after the impact. [Gravity (and the batter!) should be ignored throughout this 
question.] 

11.7 Hoop mounting a step A uniform hoop of mass M and radius a is rolling with speed 
V along level ground when it meets a step of height h (// < a). The particle C of the hoop 
that makes contact with the step is suddenly brought to rest. Find the instantaneous speed of 
the centre of mass, and the instantaneous angular velocity of the hoop, immediately after the 
impact. Deduce that the particle C cannot remain at rest on the edge of the step if 



Suppose that the particle C does remain on the edge of the step. Show that the hoop will 
go on to mount the step if 



Deduce that the hoop cannot mount the step in the manner described if // > a/2. 

11.8 Particle sliding on a cone A particle P slides on the smooth inner surface of a circular 
cone of semi-angle a. The axis of symmetry of the cone is vertical with the vertex O pointing 
downwards. Show that the vertical component of angular momentum about O is conserved in 
the motion. State a second dynamical quantity that is conserved. 

Initially P is a distance a from O when it is projected horizontally along the inside surface 
of the cone with speed u. Show that, in the subsequent motion, the distance r of P from O 
satisfies the equation 


,i2/.. I 



Case A For the case in which gravity is absent, find r and the azimuthal angle 0 explicitly 
as functions of t. Make a sketch of the path of P (as seen from ‘above’) when a = n/6. 

Case B For the case in which a = n/ 3, find the value of u such that r oscillates between 
a and 2a in the subsequent motion. With this value of u, show that r will first return to the 
value r = a after a lime 



[(§-l)(2-§)(2 + 3§)] l/r 
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11.9* Bug running on a hoop A uniform circular hoop of mass M can slide freely on a 
smooth horizontal table, and a bug of mass m can run on the hoop. The system is at rest when 
the bug starts to run. What is the angle turned through by the hoop when the bug has completed 
one lap of the hoop? [This is a classic problem, but difficult. Apply the angular momentum 
principle about the centre of the hoop, using the non-standard version given in Problem 11.11 

Planar rigid body motion 

11.10 General rigid pendulum A rigid body of general shape has mass M and can rotate 
freely about a fixed horizontal axis. The centre of mass of the body is distance h from the 
rotation axis, and the moment of inertia of the body about the rotation axis is /. Show that the 
period of small oscillations of the body about the downward equilibrium position is 



Deduce the period of small oscillations of a uniform rod of length 2a, pivoted about a horizon¬ 
tal axis perpendicular to the rod and distance b from its centre. 

11.11 From sliding to rolling A snooker ball is at rest on the table when it is projected 
forward with speed V and no angular velocity. Find the speed of the ball when it eventually 
begins to roll. What proportion of the original kinetic energy is lost in the process? 

11.12 Rolling or sliding? A uniform ball is released from rest on a rough plane inclined at 
angle o' to the horizontal. The coefficient of friction between the ball and the plane is p. Will 
the ball roll or slide down the plane? Find the acceleration of the ball in each case. 

11.13 A circular disk of mass M and radius a is smoothly pivoted about its axis of symmetry 
which is fixed in a horizontal position. A bug of mass in runs with constant speed w around the 
rim of the disk. Initially the disk is held at rest and is released when the bug reaches its lowest 
point. What is the condition that the bug will reach the highest point of the disk? 

11.14 Yo-yo with moving support A uniform circular cylinder (a yo-yo) has a light inexten- 
sible string wrapped around it so that it does not slip. The free end of the string is fastened to 
a support and the yo-yo moves in a vertical straight line with the straight part of the string also 
vertical. At the same time the support is made to move vertically having upward displacement 
Z(0 at time t. Find the acceleration of the yo-yo. What happens if the system starts from rest 
and the support moves upwards with acceleration 2g ? 

11.15 Supermarket belt A circular cylinder, which is axially symmetric but not uniform, has 
mass M and moment of inertia Mk 2 about its axis of symmetry. The cylinder is placed on a 
rough horizontal belt at right angles to the direction in which the belt can move. Initially the 
cylinder and the belt are both at rest when the belt begins to move with velocity V ( t ). Given 
that there is no slipping, find the velocity of the cylinder at time t. 

Explain why drinks bottles tend to spin on a supermarket belt (instead of moving forwards) 
if they are placed at right-angles to the belt. 
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FIGURE 11.15 The tension force T , the shear 
force S and the couple K exerted on the the 
upper part of the rod (black) by the lower part 
(grey). 



11.16* Falling chimney A uniform rod of length 2 a has one end on a rough table and is 
balanced in the vertically upwards position. The rod is then slightly disturbed. Given that its 
lower end does not slip, show that, in the subsequent motion, the angle 0 that the rod makes 
with the upward vertical satisfies the equation 

laQ 1 = 3#(1 — cos#). 

Consider now the the upper part of the rod of length 2 ya, as shown in Figure 11.15. Let T , S 
and K be the tension force, the shear force and the couple exerted on the upper part of the rod 
by the lower part. By considering the upper part of the rod to be a rigid body in planar motion, 
find expressions for S and K in terms of 6. 

If a tall thin chimney begins to fall, at what point along its length would you expect it to 
break first? 

Rigid body statics 

11.17 Leaning triangular panel A rough floor lies in the horizontal plane z = 0 and the 
planes x = 0, y = 0 are occupied by smooth vertical walls. A rigid uniform triangular panel 
ABC has mass m. The vertex A of the panel is placed on the floor at the point (2, 2, 0) and the 
vertices B, C rest in contact with the walls at the points (0, 1,6), (1, 0, 6) respectively. Given 
that the vertex A docs not slip, find the reactions exerted by the walls. Deduce the reaction 
exerted by the floor. 

11.18 Triangular coffee table A trendy Swedish coffee table has an unsymmctrical triangular 
glass top supported by a leg at each vertex. Show that, whatever the shape of the triangular 
top, each leg bears one third of its weight. 

11.19 Pile of balls Three identical balls are placed in contact with each other on a horizontal 
table and a fourth identical ball is placed on top of the first three. Show that the four balls 
cannot be in equilibrium unless (i) the coefficient of friction between the balls is at least \/3 — 
\fl, and (ii) the coefficient of friction between each ball and the table is at least |(\/3 — \fl). 
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Chapter Twelve 


I 


Lagrange’s equations 
and conservation principles 


KEY FEATURES 

The key features of this chapter are generalised coordinates and configuration space, the 
derivation and use of Lagrange’s equations, the Lagrangian, and the connection between 
symmetry of the Lagrangian and conservation principles. 


Lagrange’s equations mark a change in direction in our development of mechanics. 
Building on the work of d’Alembert, Lagrange* devised a general method for obtaining 
the equations of motion for a very wide class of mechanical systems. In earlier chap¬ 
ters we have used conservation principles for this purpose, but there is no guarantee that 
enough conservation principles exist. In contrast, Lagrange’s method is completely gen¬ 
eral and is not restricted to problems soluble by conservation principles. The method is 
so simple to apply that it is quite possible to solve complex mechanical problems whilst 
knowing very little about mechanics! However, the supporting theory has its subtleties. 

Lagrange’s equations also mark the beginning of analytical mechanics in which gen¬ 
eral principles, such as the connection between symmetry and conservation principles, 
begin to take over from actual problem solving. 


12.1 CONSTRAINTS AND CONSTRAINT FORCES 

A general mechanical system S consists of any number of particles P\, P2 ,..., 
Py. The particles of <S may have interconnections of various kinds (light strings, springs 
and so on) and also be subject to external connections and constraints. These could include 
features such as a particle being forced to remain on a fixed surface or suspended from a 


* Joseph-Louis Lagrange (Giuseppe Lodovico Lagrangia), (1736-1813). Although Lagrange is often con¬ 
sidered to be French, he was in fact born in Turin, Italy and did not move to Paris until 1787. Lagrange 
had a long career in Turin and Berlin during which time he made major contributions to mechanics, fluid 
mechanics and the calculus of variations. His famous book Mtcanique Analitique , published in Paris in 
1788, is a definitive account of his contributions to mechanics. This work transformed mechanics into a 
branch of mathematical analysis. Perhaps to emphasise this, there is not a single diagram in the whole 
book! 
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FIGURE 12.1 The general mechanical system 
S consists of any number of particles [Pi } 

(l = 1 ..., N). The typical particle Pj has 
mass nij , position vector r, and velocity Vj. 

r s' 

Fj is the specified force and Fj the 
constraint force acting on P ). 



fixed point by a light inextensible string. The pendulum, the spinning top, the bicycle and 
the solar system are examples of mechanical systems. 

Unconstrained systems 

If the particles of S are free to move anywhere in space independently of each other then 
S is said to be an unconstrained system. In this special case, the equations of motion for 
S are simply Newton’s equations for the N individual particles. Suppose that the typical 
particle Pj has mass position vector r, and velocity v,. Then the equations of motion 
for the system S are 


ntiii = F/ O' = 1 , N), 

where F , is the force acting on the particle Pj. 

Example 12.1 Two-body problem 

Write down the Newton equations for the two-body gravitation problem. 

Solution 

In this problem S consists of two particles moving solely under their mutual gravi¬ 
tational attraction. There are no constraints. The motion of the system is therefore 
governed by the two Newton equations 

/*2 — r\ . r \ — /*2 

///1 v i =m\ni2G -r, ni2V2 = m\ni 2 G - 

l r i ~ r 2\ \r\-r 2 V 

where G is the constant of gravitation. These equations, together with the initial 
conditions, are sufficient to determine the motion of the two particles. ■ 

Constrained systems 

Unconstrained mechanical systems are relatively rare. Indeed many of the problems 
solved in earlier chapters involve mechanical systems that are subject to geometrical or 
kinematical constraints. Geometrical constraints are those that involve only the position 
vectors {#*,}; kinematical constraints involve the {i>,} as well. Some typical constraints 
are as follows: 
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k 
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x 


m 



m 


FIGURE 12.2 The variables ,v and 0 are a set of generalised 
coordinates for this system. 

Definition 12.1 Generalised coordinates If the configuration of a system S is deter¬ 
mined by the values of a set of independent variables q \ . q„, then [q \ ,..., q n } is said 

to be a set of generalised coordinates for S. 

This definition deserves some explanation. 

(i) When we say the generalised coordinates must be independent variables, we mean 
that there must be no functional relation connecting them. If there were, one of the 
coordinates could be removed and the remaining n — 1 coordinates would still deter¬ 
mine the configuration of the system. The set of generalised coordinates must not be 
reducible in this way. 

(ii) When we say the generalised coordinates q \, ..., q„ determine the configuration of 
the system S, we mean that, when the values of the coordinates q \,..., q„ are given, 
the position of every particle of S is determined. In other words, the position vectors 
{r,} of the particles must be known functions of the independent variables q\,... ,q„, 
that is. 


( 12 . 2 ) 


(i = l,...,N). 


r\ = r,(<7i, 


Abstract though this concept may seem, generalised coordinates are remarkably easy 
to use. In practice, they are chosen to be displacements or angles that appear naturally in 
the problem. This is illustrated by the following examples. 

Example 12.2 Choosing generalised coordinates 

Let S be the system shown in Figure 12.2 which consists of two particles P\ and 
Pi connected by a light rigid rod of length a. The particle P\ is constrained to move 
along a fixed horizontal rail and the system moves in the vertical plane through the 
rail. Select generalised coordinates for this system and obtain expressions for the 
position vectors #*|, r 2 in terms of these coordinates. 

Solution 

Consider the variables x, 0 shown. These are certainly independent variables 
(they are not connected by any functional relation) and, when they are given, the 
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configuration of S is determined. Thus {x, 0} is a set of generalised coordinates for 

the system <S. 

In terms of the coordinates x and 0 , the positions of the particles P\ and P2 are 
given by 


r 1 =xi, 

r 2 = Or + a sin 0) i — (a cos 0) k, 

which are the expressions ( 12 . 2 ) for this system and this choice of coordinates. ■ 

Example 12.3 Choosing more generalised coordinates 

Choose generalised coordinates for the system consisting of three particles P \, P2, P3 
where P\, P2 are connected by a light rigid rod of length a and P2, P3 are connected 
by a light rigid rod of length b. The system slides on a horizontal table. [Make a 
sketch of the system.] 

Solution 

Many choices of generalised coordinates are possible. Let Oxyz be a system of 
rectangular coordinates with O on the table and Oz pointing vertically upwards. One 
set of generalised coordinates consists of (i) the x and y coordinates of the particle 
Pi, (ii) the angle 0 between the line P\ P2 and the .v-axis, (iii) the angle 0 between the 
line P2P3 and the .v-axis. A second set of generalised coordinates consists of (i) the 
x and y coordinates of the particle P2, (ii) the angle 0 between the line P\ P2 and the 
y-axis, (iii) the angle <p between the line P\ P2 and the line P2P3. ■ 

Degrees of freedom 

It is evident from the above example that the configuration of a system can be specified 
by many different sets of generalised coordinates. However the number of coordinates 
needed is always the same. In the last example, the number of generalised coordinates 
needed is always three. 

Definition 12.2 Degrees of freedom Let S be a mechanical system subject to geo¬ 
metrical constraints. Then the number of generalised coordinates needed to specify the 
configuration ofS is called the number of degrees of freedom of S. 

The number of degrees of freedom is an important property of a mechanical system. 
.Suppose, for example, that we have a system with three degrees of freedom and gener¬ 
alised coordinates <71, <72, <73. Suppose also that the system is in some given configuration 
when it is started into motion in some given way. This means that we know the initial 
values of the coordinates q\, <72, <73, and their time derivatives q\, <72, <73. How many equa¬ 
tions of motion (second order ODEs) do we need to determine the functions q\(t), <72(0* 
<73(7) that describe the subsequent motion of the system? The answer is provided by the 
general theory of ODEs. If the three functions q\(t), <72(0* < 73(0 satisfy three (indepen¬ 
dent) second order ODEs, then the general theory guarantees that there is precisely one 
solution that satisfies the prescribed initial conditions. If there arc fewer equations, the 
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Example 12.8 Finding the kinetic energy 

Find the particle velocities for the two-particle system shown in Figure 12 . 2 , and 
deduce the formula for the kinetic energy. 

Solution 

The velocities of the particles P\ , P 2 are given by 

v\ = xi, i> 2 = x i + (a cos6 i + a sin 0 k) 6. 

The kinetic energy of the system is therefore given by 

T = ~m (t>| • t>i) + \m (v 2 • v 2 ) 

= j//ix 2 + 5/// (x 2 + (aO) 2 + 2 x(a0) cos 0^ 

= m x 2 + (^ma 2 ) 6 2 + (ma cos 6) x6. ■ 

Example 12.9 General form of the kinetic energy 

Show that the kinetic energy of any holonomic mechanical system has the form 

n n 

T = '%l'51 a jk(q)qjqk 

7=1 k =1 

that is, a homogeneous quadratic form in the variables q \,..., q„, with coefficients 
depending on q. 

Solution 

Let P be a typical particle of S with position vector r and velocity v. Then 



which is a homogeneous quadratic form in the variables q \,..., q n , with coefficients 
depending on q. The kinetic energy of S is then given by 

N n n 

T = 5 "H 0, • Vi) = a jk (q) qjq k , 

/=! j=lk=l 
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D’Alembert’s principle 

If the constraint forces on a system do no virtual work, then 


• v* 

(=1 


N 



i =I 


where { v *} is any virtual motion of the system at time t. 


( 12 . 12 ) 


Differential form of d’Alembert’s principle 

D’Alembert’s principle is often quoted in the equivalent differential form 

J2 m ‘ ii ■ dr < = F i ■ dr «• 

i i 

where the [dr,] are any kinematically possible set of infinitesimal displacements of the particles {P,} at 
lime t. This form is also known as the principle of virtual work. 

D’Alembert’s principle is not often applied directly, except in statical problems. We 
have obtained it because it leads to Lagrange’s equations. 


12.5 LAGRANGE’S EQUATIONS 

From now on, we will suppose that our mechanical system is holonomic atul that 
its constraint forces do no virtual work. 

Definition 12.5 Standard system If a mechanical system is holonomic and its con¬ 
straint forces do no virtual work, we will call it a standard system. 

Consider then a standard mechanical system with n degrees of freedom and gener¬ 
alised coordinates q = (<7i, qi, ..., q n ). Consider first the virtual motion {v*} generated 
by prescribing the generalised velocities at time t to be 

d\ = 1, <72 = "• = Qn = o. 


From ( 12 . 7 ) it follows that the corresponding particle velocities are given by 

dr,■ 


* 

V; = 


dq\ 


(1,...,A0. 


Since we are assuming our system to be holonomic , these {i>*} arc a kinematically possi¬ 
ble set of velocities.* Furthermore, since we are assuming that the constraint forces do no 


* This is the only point in the derivation of Lagrange’s equations where it is essential that the system be 
holonomic. 
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it follows that 


3 /, \ 3 Vj dri 

dqj 


o (\ \ dv, 

dqj V V ‘ ' V ~ V ‘ * da; ~ Vi 


dq. 


The last step follows since, in the formula for Vj, q and q are regarded as independent vari¬ 
ables*. Then 

d f /, \1 . 3r # - d ( 3r, \ 

dt |_3 qj \ 2 /J dqj dt \dqjj 


= v,- 


« o2 „ 

3 r,- . 


3 r, a / i 

7 — + ®i * I] 
d( ij 


<j k= 

after a further application of the chain rule. In a similar way. 


3 

dqj 


(5V/ • »/) = Vi 


dVj 

dqj V ' 3 q 


n 


9 / 3 r,- . \ 3 - r, 

A=1 


7 \* =1 ^ dqfiqk* 


where, in the formula for dv,-/dqj, we have regarded q and q as independent variables. Com¬ 
bining these two results gives 


d 

dt 


5a7 0*' •") - a7 0*'57- 


dqj \ z /J dqj 
If we now multiply by ///,• and sum over i we obtain 


d ( dT \ dT . 

— I —— I-= > i)i i v/ 

dt\dqjj dqj 


3 n 
dqj 


which is the required result. ■ 

For general specified forces {F,} there is no simplification for the right sides of the 
equations ( 12 . 14 ), but we do give them names: 

Definition 12.6 Generalised force The quantity Qj, defined by 


Qj = E F f 


9r ; 

dqj 


is called the generalised force corresponding to the coordinate qj. 
We have therefore proved that: 


* The formula 

dfj drj 
dqj ~ dqj 

is sometimes facetiously referred to as ‘cancelling the dots’. Only mathematicians find this amusing. 
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Lagrange’s equations for a general standard system 

Let S be a standard system with generalised coordinates q , kinetic energy T ( q , q) 
and generalised forces {Qj}. Then, in any motion of <S, the coordinates q(t) must 
satisfy the system of equations 


d fdT\ dT 
dt V3<7y/ dqj 


(1 <j<n). 


(12.15) 


This is the form of Lagrange’s equations that applies to any standard system. 


Conservative systems 

When the standard system is also conservative, the generalised forces {Q j} can be written 
in terms of the potential energy V ( q ) as 

3V 

Qj = -7—• (12.16) 

dqj 

[This result is simply a generalisation of the formula F = — grad V ,\ 

Proof of the formula (12.16) 

Let q A , q H be any two points of configuration space that can be joined by a straight line 
parallel to the </y-axis. Then 



This equality holds for all q A , q ,f chosen as described, which implies that the two inte¬ 
grands must be equal. Hence 


Qj = 


dV 


as required. ■ 

We have therefore proved that: 
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FIGURE 12.6 The block slides on the smooth surface of the wedge which slides 
on a smooth horizontal floor. 


is planar. Find Lagrange’s equations for this system and deduce (i) the acceleration 
of the wedge, and (ii) the acceleration of the block relative to the wedge. 


Solution 

This is a standard conservative system with two degrees of freedom. Take as gener¬ 
alised coordinates x, the displacement of the wedge from a fixed point on the floor, 
and y, the displacement of the block from a fixed point on the wedge. The calcula¬ 
tion of the kinetic and potential energies in terms of x, y is performed exactly as in 
Chapter 9 and gives 

7 = \Mx 2 + jin + y 2 + 2 xy coscr^ , 

V = —mgy sin a. 


The required partial derivatives of 7 and V are then given by 

97 97 . . dV 

- = 0, —r = (M + iii)x + ( in cosor)y, - = 0. 

dx dx dx 


97 97 . 

— = 0, — = (in cos of) jc + my, 

dy dy 


dV 

av 


— = —mg sin cr. 


We can now form up the Lagrange equations. The equation corresponding to 
the coordinate x is 


— [(M + m)x + (iii cos«)>•] — 0 = 0, 
dt 

and the equation corresponding to the coordinate y is 


(12.18) 


— I (m cos or)i + my\ — 0 = mg sin a. 
dt 


(12.19) 


If we now perform the time derivatives in equations (12.18), (12.19) and solve 
for the unknowns x, y we obtain 


x = 


mg sin o' cos a 


(M + m)g sin a 
M + m sin 2 a M + m sin 2 a 

which are the required accelerations. They are both constant. ■ 
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**1 



and the potential energy by 


V — —mg(b — a) cos 9. 


There is only one Lagrange equation, namely 

— — a) 2 0 j — 0 = — mg(b — a) sin 0 

which simplifies to give 

2 ? 

9 + - -- sin 0 = 0 . 

3 (b - a) 

Interestingly, this equation is identical to the exact equation for the oscillations of a 
simple pendulum of length 3 (b — a )/2 as obtained in Chapter 6 . 

The linearised equation governing small oscillations of the cylinder about 0 = 0 
is 


0 + 


2 ,? 


-0 = 0 


3(0 - a) 

so that the period x of small oscillations is given by 




- a) y 


Example 12.12 Using Lagrange’s equations: III 

Let <5 be the system shown in Figure 12.8. The rail is smooth and the prescribed force 
F(t ) acts on the particle Pi as shown. Gravity is absent. Find the Lagrange equations 
for S. 

Solution 

S is a standard system with two degrees of freedom. The new feature is the prescribed 
external force F(t) acting on Pi. This time dependent force cannot be represented by 
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Answer 

The equation (12.22) has some fascinating solutions, but they can only be found 
numerically. First we will reduce the number of parameters by putting the equa¬ 
tion in dimensionless form. If we define the dimensionless time r by r = pt, then 
the equation becomes 

d 2 0 (Q 2 (Z 0 \ \ . 

^ + (i? + W cosr ) s,n<, = a 

We can now see that the solutions depend on the dimensionless driving frequency 
p /£2 and the dimensionless driving amplitude Zola. 

One interesting question is whether the small oscillations of the pendulum about 
0 = 0 are destabilised by the motion of the support. The answer is that it depends 
on the dimensionless parameters p/£l and Zo/tf in a complicated way. Figure 12.10 
shows results obtained by numerical solution of the equation with initial conditions 
of the form 0 = Oo, 0 = 0 when t = 0. The top graph shows the motion for the case 
p/Q = 1.1, Zo/tf = 0.2 and Go = 0.1. In this graph, 0/Go is plotted against r/2;r 
(the number of oscillations of the support). The motion turns out to be stable with 
the amplitude of the oscillations remaining close to their initial value. The bottom 
graph shows the motion for the case p/Q. = 1.9, Zo/a = 0.2 and Oo = 0.1. In this 
graph, 9/2tt (the number of revolutions of the pendulum) is plotted against t/I n (the 
number of oscillations of the support). This motion turns out to be unstable. The 
amplitude of the oscillations grows until the pendulum performs complete circles; 
it then stops and goes the opposite way. Numerical results suggest that this chaotic 
motion continues indefinitely. ■ 


12.7 THE LAGRANGIAN 

The Lagrange equations of motion (12.17) for a conservative standard system are 
expressed in terms of the kinetic energy T = T (q.q) and the potential energy V = V ( q ). 
They can however be written in terms of the single function T — V. Since dV/dqj — 0, 
the equations can be written 


(l /a T\ dT _ d (dV\ dV 
dt \dqj) dqj dt \dqjJ dqj 


that is. 


d 

dt 




(1 < j < n). 


where L(q , q) = T(q, q) — V(q) is called the Lagrangian of the system. The same 
operation can be applied to systems with moving constraints whose specified forces are 
conservative. The only difference is that, in this case, L = L(q,q,t). 
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Writing the Lagrange equations in this form makes no difference whatever to problem 
solving. However, any system of equations that can be written in this way has special 
properties. In particular, it is equivalent to a stationary principle (see Chapter 13), and 
can also be written in Hamiltonian form (see Chapter 14). This is the form most suitable 
for advanced developments and for making the transition to quantum mechanics. There 
is therefore a strong interest in any physical system whose equations can be written in 
Lagrangian form. 

Definition 12.7 Lagrangian form If the equations of motion of a holonomic system 
with generalised coordinates q can be written in the form 


dL 


dt \dqj 


dL 

dq. 


= 0 


(l < ; < n). 


(12.23) 


for some function L = L(q,q,t), then L is called the Lagrangian of the system and the 
equations are said to have Lagrangian form. 

For example, the Lagrangian for the driven pendulum is 

L(6, 6, t ) = jw (a 2 d 2 + Z 2 — 2 adZ sin#^ + nig(Z + a cos 6), 


where Z = Z(t) is the displacement of the support point. 


Velocity dependent potential 

There are systems whose specified forces are not conservative (so that V does not exist), 
but their equations of motion can still be written in Lagrangian form. Any standard system 
with generalised forces {Q j) satisfies the Lagrange equations (12.15). If it happens that 
the generalised forces can be written in the form 


Qj = 


dU 


dU 


dt \ d([j ) dqj 


(1 < j < n), 


(12.24) 


for some function U(q,q,t), then clearly the equations (12.15) can be written in 
Lagrangian form by taking 


L(q, qJ) = T ( q , q,t) - U(q, q.t). 


The function U (q,q,t) is called the velocity dependent potential of the system. 

This seems to be a mathematical artifice that has no importance in practice. It is true 
that there is only one important case in which a velocity dependent potential exists, but 
that case is very important; it is the case of a charged particle moving in electromagnetic 
fields. The following example proves this to be so for static fields. The corresponding 
result for electro dynamic fields is the subject of Problem 12.15. 
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Example 12.14 Charged particle in static EM fields 

A particle P of mass m and charge e can move freely in the static electric field E = 
E(r) and the static magnetic field B = B(r). The electric and magnetic fields exert 
a force on P given by the Lorentz force formula 

F = e E + e vx B, 

where v is the velocity of P. Show that this force can be represented by a velocity 
dependent potential U (r, r) and find the Lagrangian of the system. 

Solution 

In the static case. Maxwell’s equations for the electromagnetic field reduce to 

div D = p, curl E = 0, curl H = j, div B = 0. 

In particular, the equation curl E = 0 implies that E (r) is a conservative field and 
can be written in the form 


E = — grad (p 

where (p = (p(r ) is the electrostatic potential. The equation div B = 0 implies that 
B(r) can be written in the fonn 

B = curl A, 

where A = A{r) is the magnetic vector potential.* 

Take the generalised coordinates to be the Cartesian coordinates x, y z of the 
particle and, from now on, let r mean ( x , y , z). What we are looking for is a velocity 
dependent potential U(r, r) that yields the correct generalised forces when substi¬ 
tuted into the equations (12.24). In the present case the generalised forces Q x , Q y , 
Q z are simply the x- y- and z-components of the Lorentz force F. The electric part 
of the force, eE is easily dealt with since it is conservative and can be represented by 
the ordinary potential energy V = ecp(r). It is the magnetic part of the force that 
needs the velocity dependent potential. One wonders how anyone found the correct 
U, but they did, and it turns out to be — e r • A(r). All we need to do is to check that 
this potential is correct. 

Consider therefore the potential 

U = r • A(r) = xA x {r) + yA y {r) + zA z (r). 


* The potential 0 is unique to within an added constant, but, for any fixed there are many possibilities 
for A. Adding the grad of any scalar function to A does not change the value of B. This ambiguity in 
A makes no difference in the present context; any choice of A such that B = curl A will do. The actual 
determination of vector potentials is described in textbooks on vector field theory (see Schey [11]) for 
example. 
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Then 


du 

~dl 


Ax, 


du 

dx 


. 9A V .9A V . 8A z 

x — -h y —— + z- 


dx 


dx 


dx 


and so 


d 

~dt 



du 

dx 


d .9 A x .dA y .dA : 

— ( A x ) — x -y— ; z- 

dt dx ' dx dx 

(dA x . 9A V . dA x \ . dAy . dA y . dA z 

V 9* 9y 9z / dx - dx dx 

/ dAy dA x \ . / 9/\. v 9/4 z \ 

_V \~9x 97/ +Z V~9z dx ) 

—y (curl A) z + z (curl A) >; = — yB z + zB y 


When multiplied by — e this is Q x for the magnetic part of the force. The values of 
Qy and Q- are confirmed in the same way. 

We have therefore proved that the Lorentz force is derivable from the velocity 
dependent potential 


U = e cp(r) — e r • A(r) (12.25) 

and the Lagrangian of the particle is therefore 


L = r • r — e (p(r) + e r • A(r) 


(12.26) 


Question Why bother? 

Why should we find the Lagrangian for this system when we already know that the 
equation of motion is 


dv 

in — — eE + evxBl 
dt 


Answer 

The interest in this Lagrangian is that, from it, one can find the Hamiltonian, and this 
is what is needed to fonnulate the corresponding problem in quantum mechanics. 
This problem has important applications to the spectra of atoms in magnetic fields. ■ 


12.8 THE ENERGY FUNCTION h 

Let S be any holonomic mechanical system with Lagrangian L(q,q< t ). Then the 
equations of motion for S take the form (12.23). On multiplying the y'-th equation by qj 
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and summing over j we obtain 



Note that dL/dt means the partial derivative of L(q,q,t ) with respect to its final argu¬ 
ment, holding q and q constant. Thus 


where 



(12.27) 



(12.28) 


Definition 12.8 Energy function The function li defined by equation (12.28) is called 
the energy function of the system S. 

The energy function h is a generalisation of the notion of energy. For conservative 
systems, we will show that it is identical with the total energy E = T + V. However, for 
non-conservative systems, V may not exist and, even if it does, h and E are not generally 
equal. There are three typical cases: 

Case A If L — L(q , q , t ), then dL/dt 0 and h is not conserved. 

Example 12.15 h for the driven pendulum 

Find the energy function h for the driven pendulum problem. 

Solution 

In the driven pendulum problem, 

L = jin (a 2 0 2 + Z 2 — 2 aOZ. sin0^ + mg(Z + a cos 6 ), 

and so 

h = 0 —=— L = \m (a 2 0 2 — Z 2 ^ — ///?(Z + a cos0). 
d0 1 V / 
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This is the required energy integral. In this case, the constant h is the kinetic energy 
of the particle. 

This result is well known. When a charged particle moves in a magnetic field, 
the force is perpendicular to the velocity of the charge. Thus no work is done by the 
magnetic field and so the kinetic energy of the particle is conserved. For this system, 
V docs not exist since the force exerted by the magnetic field is velocity dependent; 
the total energy E is therefore not defined. ■ 


Case C If <5 is a conservative standard system, then S is autonomous and so h is 
conserved. In addition, the energy integral can be written in a more familiar form. In this 
case, L = T — V , where T has the form 

n n 

a jk (q ) ( l j ( h 

j= 1 k= 1 

(see Example 12.9), and V = V(q). Hence 


dL 

dqj 


dT 

Hj 


n 

- 0 = 2 ^a jk (q)q k 

k =l 


and so 


ft ^ j ^ ft ft 

E ^r'ii = 2 EE*j‘(«l 'i, = 2 t . 

y=u=i 


The energy integral therefore becomes 


2 T — (T — V) = constant, 


that is 


T + V = constant 


(12.30) 


which is the classical form of conservation of energy. In this case, the constant is the 
total energy E of the system. 

12.9 GENERALISED MOMENTA 

The generalised momenta of a mechanical system are defined in a different way 
to conventional linear and angular momentum. 

Definition 12.9 Generalised momenta Consider a holonomic mechanical system 
with Lagrangian L = L(q, q, t ). Then the scalar quantity pj defined by 

8L 
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is called the generalised momentum corresponding to the coordinate q } . It is also called 
the momentum conjugate to q j. 

Example 12.17 Finding generalised momenta 

Consider the problem in Example 12.10 whose Lagrangian is 

L = ^ Mx 2 + jin + y 2 + 2 xy cosa^ + mgy sin a. 

Find the generalised momenta. 

Solution 

With this Lagrangian, the momenta p x and p y are given by 

dL . . 

p x = — = Mx + m (.v + y cos a ), 

3.v 

3 L 

p v = —- = in (y + x cos a). ■ 

3 >' 

Generalised momenta are often recognisable as components of linear or angular 
momentum of the system. In the above example, p x is the horizontal component of the 
linear momentum of but p y is not a component of linear momentum. 

Conservation of generalised momenta 

In terms of the generalised momentum pj, the j- th Lagrange equation can be written 

dpj 3 L 
dt dqj 

It follows that ifdL/dqj = 0 (that is, if the coordinate qj is absent from the Lagrangian), 
then the generalised momentum pj is constant in any motion. Such ‘absent’ coordinates 
are said to be cyclic. We have therefore shown that: 


Conservation of momentum 

If qj is a cyclic coordinate (in the sense that it does not appear in the Lagrangian), 
then pj, the generalised momentum conjugate to qj, is constant in any motion. 


In the last example, the coordinate x is cyclic but y is not. It follows that p x is 
conserved but p y is not. 


Example 12.18 A cyclic coordinate for the spherical pendulum 


Consider the spherical pendulum shown in Figure 11.7. The Lagrangian L is given 
by 


L = \ma 2 


0 2 


+ (sind0) 2 J + hi ga cos 0, 
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Suppose now that q(t) is a motion of S under the potential V (q). Then q satisfies 
Lagrange’s equations which we choose to take in the form 


7»/ 


9/7 


9F 


dqj dq j 

On multiplying the y'-th Lagrange equation by 

dq) 


U= l,...,/*)• 


dX 


Jk=0 


and summing over j we obtain 


* ( J2 

J = 1 


9/7 

dq; 


dq)_ 

dX 


Jk= 0 j 


A 9F 

^ dqj 


7=1 


d Jj_ 

dX 


J A.=0 


Now 


n „ 

E drj 

dql 
7=1 * J 


K 


= E 


9A. ' \a<5f A 

-U=o 7=1 L \ ^7 

by the chain rule. Furthermore 


9 , ju \ dq j 


-U=o 




(12.31) 


A=0 


J , 9/f 

—r,(</ x ) = —L = «, 
dX H dX 


since rf = r, (^) + A./i in the given displacement. 
In the same way. 


V — 


dx 


-E 


_lx=0 7=1 L 


9<7 


9 , lx \ dq) 

V(q x )• 7 


9X 


X =0 


5x V( *> 


= o, 


-U=0 


since F is unchanged by the displacement, that is, V (q' ) = F (^). On combining these results 
together, we obtain 

• n = 0. 

I 

Finally, since /i is a constant vector, we may integrate with respect to t to obtain 


j • n = C, 


where C is a constant. Thus the component of linear momentum in the n-direction is con¬ 
served. ■ 


For example, this theorem applies to the system shown in Figure 12.6 (the wedge and 
block). This system can be translated in the x -direction without violating any constraints, 
and this translation leaves the potential energy unchanged. The conserved quantity is the 
component oflincar momentum in the x -direction. 
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Theorem 12.2 Invariance of V under rotation Let S be a conservative standard sys¬ 
tem with potential energy V. Then if S can be rotated (as if rigid) about the fixed axis 
{0,k\ without violating any constraints, and ifV is unchanged by this rotation, then, in 
any motion of S, the angular momentum about the axis \0,k\ is conserved. 

Proof. The proof closely follows that in the last theorem. Let A. be the angle turned in a 
rotation of <S about the fixed axis {O, k), where O is also the origin of position vectors. Then 
by following the same steps, we obtain, as before. 



This time the A-derivative means the rate of change with respect to the rotation angle A. so that 


Since rf = r, when A = 0, it follows that 



Finally, since A: is a constant vector, we may integrate with respect to t to obtain 



where C is a constant. Thus the angular momentum about the axis [O , A:} is conserved. ■ 

For example, this theorem applies to the spherical pendulum. The pendulum can be 
rotated about the axis {O, k) (where O is the support and k points vertically upwards) 
without violating any constraints, and this rotation leaves the potential energy unchanged. 
The conserved quantity is the angular momentum of the pendulum about the vertical axis 
through O. 

These two theorems are powerful tools for identifying conserved components of lin¬ 
ear or angular momentum even when the system is very complex. For example, any con¬ 
servative standard system whose potential energy is invariant under all translations and 
rotations conserves all three components of linear and angular momentum, as well as the 
total energy, making seven conserved quantities in all. 

Noether’s theorem 

The two theorems above are particular instances of an abstract result known as Noether’s 
theorem.* In each of the above cases, there is a one-parameter family of mappings {97T / '}, 


After the German mathematician Emmy Amalie Noether (1882-1935). Despite the obstacles placed in 
the way of women academics at the time, she made fundamental contributions to pure mathematics in 
the areas of invariance theory and abstract algebra. The result now known as Noether’s theorem was 
published in 1918. 


Copyrighted material 







12.10 Symmetry and conservation principles 


359 


parametrised by a real variable A., that act on the configuration space Q, that is, 

m A . 

q - * q k . (12.32) 

In each case A. = 0 corresponds to the identity mapping (that is, q —>• q), and in each case 
the potential energy V(q) is invariant under {9Ut A }, that is, 

V(q k )=V(q). 

From these facts, we were able to prove that, in each case, a certain momentum component 
was a constant of the motion. 

This idea was generalised by Noether to apply to any Lagrangian system and any 
family of mappings {9Jt A }, provided that the Lagrangian L is invariant under {9Ut A } in 
the sense that 


L(q k ,q'\t) = L(q, q, t) 


for all A.. In this formula, q k is a known function of the variables q and A. (as defined by the 
mapping ftT? A ), but we have not yet said what we mean by q K . This is however defined in 
the following commonsense way: let A. be fixed and let q' be the image point of a typical 
point q. Suppose now that the point q has velocity q in the configuration space Q. This 
motion of q imparts a velocity to the image point q k , and it is this velocity that we call 
q k . This definition is expressed by the formula 


• a. . 

j =i 


(12.33) 


from which we see that q' is a known function of the variables q , q and A.. 

The formal statement and proof of Noether’s theorem are as follows: 

Theorem 12.3 Noether’s theorem Let S he a holonomic mechanical system with 
Lagrangian L(q , q, t ) and let {9T1 A } he a one-parameter family of mappings that have 
the action 


m k . 

q -► q k 


(12.34) 


where q k = q when A. = 0. If the mappings {9Jt A } leave L invariant in the sense that 


L(q k ,q' ,t) = L{q,q, t) 


(12.35) 


for all A., then the quantity 


Epj 


j= 1 



(12.36) 
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It follows that 


d 

dt 




= 0 


and this proves the theorem. ■ 


The importance of Noether’s theorem lies in the general notion that an invariance of 
the Lagrangian gives rise to a constant of the motion. Such invariance properties are of 
great importance when the Lagrangian formalism is extended to continuous systems and 
fields. For more details, see Goldstein [4] who will tell you more about Noether’s theorem 
than you wish to know! 


Problems on Chapter 12 

Answers and comments arc at the end of the book. 

Harder problems carry a star (*). 

Conservative systems 

12.1 A bicycle chain consists of N freely jointed links forming a closed loop. The chain can 
slide freely on a smooth horizontal table. How many degrees of freedom has the chain? How 
many conserved quantities arc there in the motion? What is the maximum number of links the 
chain can have for its motion to be determined by conservation principles alone? 

12.2 Attwood’s machine A uniform circular pulley of mass 2m can rotate freely about its 
axis of symmetry which is fixed in a horizontal position. Two masses ///, 3/// arc connected by 
a light inextensible string which passes over the pulley without slipping. The whole system 
undergoes planar motion with the masses moving vertically. Take the rotation angle of the 
pulley as generalised coordinate and obtain Lagrange’s equation for the motion. Deduce the 
upward acceleration of the mass m. 

12.3 Double Attwood machine A light pulley can rotate freely about its axis of symmetry 
which is fixed in a horizontal position. A light inextensible string passes over the pulley. At one 
end the string carries a mass 4m, while the other end supports a second light pulley. A second 
string passes over this pulley and carries masses m and 4m at its ends. The whole system 
undergoes planar motion with the masses moving vertically. Find Lagrange’s equations and 
deduce the acceleration of each of the masses. 

12.4 The swinging door A uniform rectangular door of width 2a can swing freely on its 
hinges. The door is misaligned and the line of the hinges makes an angle a with the upward 
vertical. Take the rotation angle of the door from its equilibrium position as generalised coor¬ 
dinate and obtain Lagrange’s equation for the motion. Deduce the period of small oscillations 
of the door about the equilibrium position. 


Copyrighted material 








362 


Chapter 12 Lagrange’s equations and conservation principles 


12.5 A uniform solid cylinder C with mass ni and radius a rolls on the rough outer 
surface of a fixed horizontal cylinder of radius b. In the motion, the axes of the two 
cylinders remain parallel to each other. Let 0 be the angle between the plane con¬ 
taining the cylinder axes and the upward vertical. Taking 6 as generalised coordinate, 
obtain Lagrange’s equation and verify that it is equivalent to the energy conservation 
equation. 

Initially the cylinder C is at rest on top of the fixed cylinder when it is given a very small 
disturbance. Find, as a function of 0 , the normal component of the reaction force exerted on 
C. Deduce that C will leave the fixed cylinder when 0 = cos -1 (4/7). Is the assumption that 
rolling persists up to this moment realistic? 

12.6 A uniform disk of mass M and radius a can roll along a rough horizontal rail. A par¬ 
ticle of mass m is suspended from the centre C of the disk by a light incxtcnsiblc string of 
length b. The whole system moves in the vertical plane through the rail. Take as generalised 
coordinates x , the horizontal displacement of C, and 6 , the angle between the string and the 
downward vertical. Obtain Lagrange’s equations. Show that x is a cyclic coordinate and find 
the corresponding conserved momentum p x . Is p x the horizontal linear momentum of the 
system? 

Given that 0 remains small in the motion, find the period of small oscillations of the 
particle. 

12.7 A uniform ball of mass m rolls down a rough wedge of mass M and angle a, which 
itself can slide on a smooth horizontal table. The whole system undergoes planar motion. 
Mow many degrees of freedom has this system? Obtain Lagrange’s equations. For the special 
case in which M = 3////2, find (i) the acceleration of the wedge, and (ii) the acceleration of 
the ball relative to the wedge. 

1 2 .8 A rigid rod of length 2 a has its lower end in contact with a smooth horizontal floor. 
Initially the rod is at an angle a to the upward vertical when it is released from rest. The 
subsequent motion takes place in a vertical plane. Take as generalised coordinates x, the 
horizontal displacement of the centre of the rod, and 9, the angle between the rod and the 
upward vertical. Obtain Lagrange’s equations. Show that x remains constant in the motion 
and verify that the 0 -equation is equivalent to the energy conservation equation. 

* Find, in terms of the angle 0 , the reaction exerted on the rod by the floor. 


Moving constraints 

12 .9 A particle P is connected to one end of a light incxtcnsiblc string which passes through 
a small hole O in a smooth horizontal table and extends below the table in a vertical straight 
line. P slides on the upper surface of the table while the string is pulled downwards from 
below in a prescribed manner. (Suppose that the length of the horizontal part of the string is 
R{t) at time t.) Take 0, the angle between OP and some fixed reference line in the table, as 
generalised coordinate and obtain Lagrange’s equation. Show that 0 is a cyclic coordinate and 
find (and identify) the corresponding conserved momentum p#. Why is the kinetic energy not 
conserved? 

If the constant value of p$ is niL, find the tension in the string at time t. 
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potentials that generate the field {E, B} by the formulae 

9 A 

E = — grad d> -, B = curl A. 

& dt 

Show that the potentials <p — 0, A — tz i generate a field {E, B) that satisfies all four 
Maxwell equations in free space. A particle of mass m and charge e moves in this field. Find 
the Lagrangian of the particle in terms of Cartesian coordinates. Show that x and y are cyclic 
coordinates and find the conserved momenta p x , p y . 


12.16* Relativistic Lagrangian The relativistic Lagrangian for a particle of rest mass iiiq 
moving along the jc-axis under the simple harmonic potential field V = \iiiq^1 2 x 2 is given by 


L = moc z 1 




2 \ 1 / 2 ’ 


- ^m^x 2 . 


Obtain the energy integral for this system and show that the period of oscillations of amplitude 
a is given by 


4 f n/1 1 + U 2 cos 2 0 


= -f 

& Jo 


^1 + ^c 2 cos 2 0^ 


\ 1/2 


dG, 


where the dimensionless parameter c = Q.a/c. 

Deduce that 

r = l'[ l + ^ 2+0 

when e is small. 

Conservation principles and symmetry 


(')]■ 


12.17 A particle of mass m moves under the gravitational attraction of a fixed mass M situated 
at the origin. Take polar coordinates r, 0 as generalised coordinates and obtain Lagrange’s 
equations. Show that 0 is a cyclic coordinate and find (and identify) the conserved momentum 
pe. 


12.18 A particle P of mass m slides on the smooth inner surface of a circular cone of semi¬ 
angle a. The axis of symmetry of the cone is vertical with the vertex O pointing downwards. 
Take as generalised coordinates r, the distance OP, and <p, the azimuthal angle about the 
vertical through O . Obtain Lagrange’s equations. Show that 0 is a cyclic coordinate and find 
(and identify) the conserved momentum p,p. 

12.19 A particle of mass m and charge e moves in the magnetic field produced by a current 
I flowing in an infinite straight wire that lies along the z-axis. The vector potential A of the 
induced magnetic field is given by 

A r = A 0 = 0, A z = - (^-) Inr, 
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where r ,6,z are cylindrical polar coordinates. Find the Lagrangian of the particle. Show that 
6 and z are cyclic coordinates and find the corresponding conserved momenta. 

12.20 A particle moves freely in the gravitational field of a fixed mass distribution. Find 
the conservation principles that correspond to the symmetries of the following fixed mass 
distributions: (i) a uniform sphere, (ii) a uniform half plane, (iii) two particles, (iv) a uniform 
right circular cone, (v) an infinite uniform circular cylinder. 

12 .21* Helical symmetry A particle moves in a conservative field whose potential energy 
V has helical symmetry. This means that V is invariant under the simultaneous operations (i) 
a rotation through any angle a about the axis Oz, and (ii) a translation ca in the z-direction. 
What conservation principle corresponds to this symmetry? 

Computer assisted problem 

12.22 Upside-down pendulum A particle P is attached to a support S by a light rigid rod of 
length a, which is freely pivoted at S. P moves in a vertical plane through S and at the same 
time the support S is made to oscillate vertically having upward displacement Z = ea cos pt 
at time t. Take 9, the angle between SP and the upward vertical, as generalised coordinate 
and show that Lagrange’s equation is 



where £ 2 2 = g/a. The object is to show that, for suitable choices of the parameters, the 


pendulum is stable in the vertically upwards position! 

First write the equation in dimensionless form by introducing the dimensionless time r = 
pt. Then 9(r) satisfies 



Solve this equation numerically with initial conditions in which the pendulum starts from rest 
near the upward vertical. Plot the solution 0 ( r) as a function of r for about twenty oscillations 
of the support. Try c = 0.3 with increasing values of the parameter p/S 2 in the range 1 < 
p/Q. < 10. You will know that the upside-down pendulum is stable when 6 remains small in 
the subsequent motion. 

Even more surprisingly, it is possible to stabilise the double pendulum (or any multiple pendulum) in 
the upside-down position by vibrating the support. See Acheson [1] for photographs of a triple pendulum 
(and even a length of floppy wire) stabilised in the upside-down position by vibrating the support. However, 
the famous but elusive ‘Indian Rope Trick’, in which a small boy climbs up a self-supporting vertical rope, 
has yet to be demonstrated! 
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The calculus of variations 
and Hamilton’s principle 


KEY FEATURES 

The key features of this chapter are integral functionals and the functions that make them 
stationary, the Euler-Lagrange equation and extremals, and the importance of variational 
principles. 


The notion that physical processes are governed by minimum principles is older than 
most of science. It is based on the long held belief that nature arranges itself in the 
most ‘economical’ way. Actually, many ‘minimum’ principles have, on closer inspection, 
turned out to make their designated quantity stationary, but not necessarily a minimum. 
As a result, they are now known to be variational principles, but they are no less impor¬ 
tant because of this. A good example of a variational principle is Fermat’s principle of 
geometrical optics, which was proposed in 1657 as Fermat's principle of least time in the 
form: 

Of all the possible paths that a light ray might take between two fixed points, the 
actual path is the one that minimises the travel time of the ray. 

Fermat showed that the laws of reflection and refraction could be derived from his 
principle, and proposed that the principle was true in general. Not only did Fermat’s 
principle ‘explain’ the known laws of optics, it was simple and elegant, and was capable 
of extending the laws of optics far beyond the results that led to its conception. This 
example explains why variational principles continue to be sought; it is because of their 
innate simplicity and elegance, and the generality of their application. 
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The variational principle on which it is possible to base the whole of classical mechan¬ 
ics was discovered by Hamilton* * and is known as Hamilton’s principle. f In its original 
form, it stated that: 

Of all the kinematically possible motions that take a mechanical system from one 
given configuration to another within a given time interval, the actual motion is the one 
that minimises the time integral of the Lagrangian of the system. 

Lagrange’s equations of motion can be derived from Hamilton’s principle, which can 
therefore be taken as the basic postulate of classical mechanics, instead of Newton’s laws. 
More importantly however, Hamilton’s principle has had a far reaching influence on many 
areas of physics, where apparently non-mechanical systems (fields, for example) can be 
described in the language of classical mechanics, and their behaviour characterised by 
‘Lagrangians’. Hamilton’s principle is generally regarded as one of the most elegant and 
far reaching principles in physics. 

In order to get concrete results from a variational principle, it is usually necessary to 
convert it to a differential equation. This can be done by using the calculus of variations, 
which is concerned with minimising or maximising the value of an integral functional. The 
calculus of variations is a large subject and we develop only those aspects most relevant to 
interesting physical problems, and to the understanding and use of variational principles. 

13.1 SOME TYPICAL MINIMISATION PROBLEMS 

The calculus of variations arose from attempts to solve minimimisation and max¬ 
imisation problems that occur naturally in physics and mathematics, but the scope of appli¬ 
cations has since widened greatly. We begin by describing three minimisation problems 
taken from geometry, physics and economics respectively. Maximisation problems also 
occur, but these can be converted into minimisation problems merely by reversing the sign 
of the quantity to be maximised. Thus we lose no generality by presenting the theory for 
minimisation problems only. 

1. Shortest paths - geodesics A basic problem of the calculus of variations is that of 
finding the path of shortest length that connects two given points A and B. If the path 
has no constraints to satisfy, such as having to go round obstacles or lie on a given curved 


* Sir William Rowan Hamilton (1805-1865), was a great genius but an unhappy man. He was appointed 
Professor of Astronomy at Trinity College Dublin at the age of twenty one, whilst still an undergraduate. 
Much of his early work is on optics where he introduced the notion of the characteristic function. His 
paper On a GeneraI Method in Dynamics, which contains what is now called Hamilton’s principle, was 
presented to the Royal Irish Academy in 1834. He was knighted in 1835. However, his personal life was 
as chaotic as his academic achievments were brilliant. He was frustrated in love, frequently depressed 
and a heavy drinker; this culminated in his making an exhibition of himself at a meeting of the Irish 
Geological Society. He spent the later years of his life working on the theory of quaternions, but they 
were never the great discovery he had hoped for. 

* Hamilton’s principle is sometimes called the principle of least action. The terminology in this area is 
confusing, since another variational principle of mechanics, Maupertuis’s principle, is also referred to as 
the principle of least action. 
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FIGURE 13.1 A soap film is stretched between two circular wires. It has the form of a surface of 
revolution generated by rotating the curve y = V(x) about the x-axis. 

surface, the answer is well known; it is the straight line joining A and B. However, it is 
still instructive to formulate this problem and to check that the calculus of variations docs 
yield the expected result. 

Suppose that A = (0,0), B — (1,0) and that the general path in the (x, y)-plane 
connecting these points is y — y(x). Then the total length of the path is 



(13.1) 


The problem is to find the function v(x), satisfying the end conditions y(0) = 0, y(l) = 
0, that minimises the length L. This is the subject of Problem 13.3; the answer is indeed 
the straight line y = 0. 

In general, paths of shortest length are called geodesics. For example, geodesics on 
the surface of a sphere are great circles. Some surfaces (such as the cylinder and cone) are 
developable, which means that they can be rolled out flat without changing any lengths. 
The geodesic can be drawn while the surface is flat (it is now a straight line) and the 
surface can then be rolled back up again. In general however, surfaces are not developable 
and geodesics have to be found by the calculus of variations. 

2. The soap film problem Two rigid circular wires each of radius b have the same axis 
of symmetry and are fixed at a distance 2 a from each other. A soap film is created which 
spans the two wires as shown in Figure 13.1. The soap film has the form of a surface of 
revolution with the two circular ends open. What is the shape of the soap film? 

Fortunately, we can formulate this problem without resorting to the theory of thin 
membranes! Since the air pressure is the same on either side of the film, and since the 
effect of gravity is negligible, surface tension is the dominant effect. The condition that 
the total energy be a minimum in equilibrium is therefore equivalent to the condition that 
the area of the film be a minimum. 

Let the film be the surface generated by rotating the curve v = v(x) about the x-axis. 
Then the surface area A of the film is 



—a 


1/2 


(13.2) 
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conditions have the form x{a) = A and x{b) = B , where A, B are given. It is convenient 
to regard the function x(t) that appears in (13.7) as being composed of x*(t) together with 
a variation* h(t) so that we may alternatively write 

J[x* + h]> J[x*] (13.8) 

for all admissible variations /?(/). Since x must satisfiy the same end conditions as x*, the 
admissible variations are those for which li(a) = h(b) = 0 (see Figure 13.2). 

Most readers will find the theory that follows quite difficult. To aid understanding, the 
argument is broken down into three separate steps. 


The variation in J and the meaning of ‘stationary’ 

The first step is to give a meaning to the statement that a function x(t) makes the functional 
J 1 *] stationary. 

Let x*(t) be any admissible function and h (t) an admissible variation. When h is a 
small variation, we can estimate the corresponding variation in J[x ] by ordinary calculus, 
as follows: 

Let t have any fixed value. Then x and x are just real numbers and the variation in F 
due to the variation h in x is' 

F(x*+h,x* + ii,t)-F(x\x\t) = h?F( X \x\t) + h i !F(x*,x\t) + o(h 2 + i, 1 ), 


when h and h are both small. On integrating both sides of this equation with respect to t 
over the interval [a , /?], the corresponding variation in J is given by 


J[x* + h]-J[x*] 



t) + h^(x\x*,t) 
dx 


dt+o(\\h\\ 2 ), (13.9) 


for small ||/?||, where ||/?|| is defined by 


||/?|| = max |/»(0I + max \h(t)\ 

a<t<b a<t<b 


and is called the norm ? of/?. (When \ \li\ \ is small, both |/?(/)| and |/?(/)| are small through¬ 
out the interval \a,b\.) The second term in the integrand of equation (13.9) can be inte¬ 
grated by parts to give 


I. 


b q p 

h — (x,x*,t)dt = 
dx 


, dF , * •* , 

h—(x*,x*,t) 
dx 




* These are the variations that give the ‘calculus of variations' its name. 

T The formula we are using is that, if G(z/, v) is a function of the independent variables u and v , then the 
variation in G caused by the variations z/o — > z/o + h and vq —» i/o + k is given by 

dG dG / 9 o\ 

G(zz 0 + /?, uq -b k) - G(z/q, uq) = h— (wq* ^o) + k — (z/o, uq) + O + k ) , 

dll dv v / 

for small /?, k. Note that the partial derivatives are evaluated at the ‘starting point’ (z/o, i>o)* 

+ Do not be too concerned over the exact definition of ||/z||. It must be written in some such way for 
mathematical correctness, but the only property that we will need is that ||/z|| is proportional to h in the 
sense that, if h is multiplied by a constant A., then so is ||/?||. 
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G 


bump function h 


t 


FIGURE 13.3 An interval (c, d) in which G(t) > 0 and a 
corresponding bump function h(t). The integral of G x h 
must be positive. 

for all admissible variations /?. In particular, this inequality must remain true if h is replaced by — /?, but 
this is only possible if the integral has the value zero. 

Equation (13.10) therefore reduces to 



(13.15) 


for small ||/?||. By definition, this means that x* makes J[x ] stationary. The same result 
applies to functions that provide a local maximum for J [x J. Our result is summarised as 
follows: 

Functions that minimise or maximise J make J stationary 

If the function x* provides a local mimimum for the integral functional J [jc ], then 
x* makes J stationary. The same applies to functions that provide a local maximum 
for J. 


Euler-Lagrange equation 

We will now obtain the differential equation that must be satified by a function x*(t) that 
makes J far] stationary. This is the counterpart of the elementary condition f'(x) = 0. To 
find this, we return to equation (13.12). In the integrand, the function inside the square 
brackets looks complicated but it is just a function off and, if we denote it by G(t), then 





(13.16) 


a 


for all admissible variations /?. In fact, the only function G{t ) for which this is possible is 
the zero function, that is, G{t) = 0 for a < t < b. The proof is as follows: 

Suppose that G{t) is not the zero function. Then there must exist some interval (c, d ), lying inside 
the interval ( a , b) in which G(t) / 0 and thus has constant sign (positive, say). Take h(t) to be a ‘bump 
function’ (as shown in Figure 13.3), which is zero outside the interval (c, d) and positive inside. For such a 
choice of h , 



G(t) h(t) dt = f G(t) h(t) dt > 0, 
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since the integral of a positive function must be positive. This contradicts equation (13.16) and so G must 
be the zero function. 


Wc have therefore shown that 


8F * * d / 3F * * \ 


for a < t < b, which is the same as saying that jc* must satisfy the Euler-Lagrange 
differential equation 

dF d/dF\_ 
dx dt \dx ) 


The above argument is reversible and so the converse result is also true. Our result is 
summarised as follows: 


Euler-Lagrange equation 

If the function x* makes the integral functional 

/[*] = f F ( x,x,t ) dt, 

J a 

stationary, then jc* must satisfy the Euler-Lagrange differential equation 


(13.17) 


d 

dt 

The converse result is also true. 




(13.18) 


It is very convenient to give solutions of the Euler-Lagrange equation a special name. 

Definition 13.2 Extremals Any solution of the Euler-Lagrange equation is called an 
extremal* of the functional /[jc]. 


Key results of the calculus of variations 

• If the function jc* minimises or maximises the functional /, then x* makes / 
stationary and so jc* must be an extremal of /. 

• If jc* is an extremal of 7, then jc* makes J stationary, but it may not minimise or 
maximise 7. 


The term extremal should not be confused with extremum (plural: extrema). Extremum means maximum 
or minimum. Thus a function that provides an extremum of J must make J stationary and so must be 
an extremal of J (that is, it must satisfy the Euler-Lagrange equation). The converse is not true. An 
extremal may not provide a mimimum or maximum of J. (Try reading this again!) 
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Fortunately, solving problems with the E-L equation is much easier than the preceding 
theory! The E-L equation is a second order non-linear ODE, and so one needs a measure 
of luck when it comes to finding solutions. Nevertheless, many interesting cases can be 
solved in closed form. 


Example 13.1 Finding extremals 1 


Find the extremal of the functional 


/[*] 



that satisfies the end conditions x(l) = 5 and .v(2) =11. 

Solution 

By definition, extremals arc solutions of the E-L equation. In the present case, F = 
x 2 /4/ so that 


9 F _ dF _ x 

dx ’ 9x 21' 


and the E-L equation takes the fonn 


d 

It 



-0 = 0 . 


On integrating, we obtain 


x = ct 2 + d, 

where c and d are constants of integration. The extremals of J are therefore a family 
of parabolas in the ( t , x)-plane. The admissible extremals are those that satisify the 
prescribed end conditions x(l) = 5 and x(2) = 11. On applying these conditions, we 
find that c = 2 and d = 3 so that the only admissible extremal of J [x] is given by 


X = 2t 2 + 3.m 


Question Maximum, minimum, or neither? 

Docs the extremal x = 2 1 2 + 3 maximise or minimise 7? 

Answer 

The admissible extremal x is known to make J stationary. It may minimise J , max¬ 
imise 7, or do neither. With the theory that we have at our disposal, we cannot 
generally decide what happens. However, in a few simple cases (including this one), 
we can decide very easily. 
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Let // be any admissible variation (not necessarily small) and consider the varia¬ 
tion in ./ that it produces, namely. 



since h is an admissible extremal satisifying //(l) = h{2) = 0. Hence 

r 2 i , 2 

J[x + h]-J[x] = / — ^r>0, 

J i 4/ 

since the integral of a positive function must be positive. Thus x actually provides the 
global minimum of J[ x ]. The global minimum value of J is therefore J \ 21 2 + 31 = 
6 . ■ 


A useful integral of the Euler-Lagrange equation 

Not all examples arc as easy as the last one and the E-L equation often has a complicated 
form. However, for the case in which the function F(x,x, t) has no explicit dependence 
on t (that is, F = F(x, x)), the second order E-L equation can always be integrated once 
to yield a first order ODE. This offers a great simplification in many important problems. 
Suppose that F = F(x,x). Then it follows from the product rule and the chain rule 

that 



Thus, if x satisfies the E-L equation 

d (8F\ 8F 
dt \ dx J dx 


it follows that x satisfies the first order equation 

SF 

x - F = constant. (13.20) 

dx 

for some choice of the constant c. Conversely, if jc is any non-constant solution of equation 
(13.20) then it satisfies the E-L equation. 


Copyrighted material 




378 


Chapter 13 The calculus of variations and Hamilton’s principle 


The solutions are* 


±(c 2 — Jt 2 )' /2 = t + D, 

where D is a constant of integration. Hence the extremals of J are (the upper halves 
of) the family of circles 


x 2 + (t + D) 2 = C 2 

in the (/, ;t)-plane. On applying the given end conditions, we find that C = 5 and 
D = —3, so that the only admissible extremal is an arc of the circle with centre 
(3, 0) and radius 5, namely 

x = W16 + 6 1 - C (0 < t < 7). 

Since there is only one admissible extremal, it follows that, if it were known that 
a minimising (or maximising) function existed, then this must be it. However, we 
have no such knowledge and no means of deciding whether x provides a minimum or 
maximum for 7, or neither. (It actually provides the global minimum of J .) ■ 

Our final example is the famous brachistochrone problem.^ 


Example 13.3 The brachistochrone (shortest time) problem 


Two fixed points P and Q are connected by a smooth wire lying in the vertical plane 
that contains P and Q. A particle is released from rest at P and slides, under uniform 
gravity, along the wire to Q. What shape should the wire be so that the transfer is 
completed in the shortest time? 


Solution 

Suppose that the wire lies in (t, z)-plane, with Oz pointing vertically downwards, 
with P at the origin, and Q at the point (a, b). Let the shape of the wire be given by 
the curve z = z(x). Then, since the particle is released from rest when z = 0, energy 
conservation implies that the speed of the particle when its downward displacement 
is z is (2£z)*/ 2 . The time T taken for the particle to complete the transfer is therefore 


, m r[ i + i 2 )' /2 
Tlz\ = (2gr' /2 J g z , /2 ; dx. 


(13.22) 


* It is evident that these equations also admit the constant solution x = C. However, it may be verified that 
the E-L equation for this problem has no constant solutions. 

T This famous minimisation problem was posed in 1696 by Johann Bernoulli (who had already found 
the solution) as a not-so-friendly challenge to his mathematical contemporaries. Solutions were found 
by Jacob Bernoulli, de l’Hopital, Leibnitz, and Newton, who (according to his publicity manager) had 
the answer within a day. Newton published his solution anonymously, but Johann Bernoulli identified 
Newton as the author declaring ‘ one can recognise the lion by the marks of his claw'. The last word goes 
to Newton. He complained 7 do not love to be pestered and teased by foreigners about mathematical 
things 


Copyrighted material 








380 


Chapter 13 The calculus of variations and Hamilton’s principle 



2 


FIGURE 13.4 The curve that minimises T[z] is an arc of a cycloid. 


Since there is only one admissible extremal, it must be the minimising curve for 
T[z],provided that a minimising curve exists at all. In view of the physical origin of 
the problem, most of us are happy to take this for granted, but purists may sleep more 
soundly in the knowledge that this can also be proved mathematically. ■ 


13.3 VARIATIONAL PRINCIPLES 

The laws of physics arc usually formulated in terms of variables or fields that 
satisfy differential equations. Thus, the generalised coordinates q(t ) of a mechanical sys¬ 
tem satisfy Lagrange’s equations (a system of ODEs), the electromagnetic field satisfies 
Maxwell’s equations (a system of PDEs), the wave function of quantum mechanics satis¬ 
fies Schrodinger’s wave equation, and so on. But there is an alternative way of expressing 
these laws in terms of variational principles. In the variational approach, the actual 
physical behaviour of the system is distinguished by the fact that it makes a certain inte¬ 
gral functional stationary. Thus all of the physics is somehow contained in the integrand 
of this functional! 

Expressing physical laws in variational form does not make it any easier to solve 
problems. Indeed, problems will continue to be solved by using differential equations. 
The virtue of the variational formulation is that it is much easier to extend existing theory 
to new situations. For example, the theory of fields can be developed in the language of 
classical mechanics by using the variational formulation. 

Fermat’s principle 

These ideas are nicely illustrated by an example most readers will be familiar with: 
the paths of light rays in geometrical optics. When travelling through a homogeneous 
medium, light rays travel in straight lines. But, on meeting a plane interface between two 
different homogeneous media, the ray is either reflected or suffers a sharp change of direc¬ 
tion called refraction, as shown in Figure 13.5. This change of direction is governed by 
Snell’s law of refraction n\ sin(?i = /?2 sin(? 2 > where n\ and nj are the refractive indices 
of the two media, and d\ and arc the angles that the ray makes with the normal to the 
interface. In terms of the angle \j/ used in Figure 13.5, Snell’s law takes the form 


n 1 cos 1 fs\ = /12 cos ^2 = ut, cos 1 ^ 3 . 
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FIGURE 13.5 When a light ray passes between homogeneous media (left), it satisfies Snell’s law 
/?i cos x]/\ = /?2 cos ^2 = n 3 cos xf/ 3 . In the continuous case with n = n(y) (right), Snell’s law 
becomes n cos \f = constant. 


In the more general case of an inhomogeneous medium in which n varies continuously 
in the y-direction, one would then expect curved rays that satisfy Snell’s law in the form 

11 cos \J/ = constant. 

A variational principle consistent with these rules was proposed by Fermat in 1657 
and became known as Fermat’s principle of least time. This stated that: 

Of all the possible paths that a light ray might take between two fixed points, the 
actual path is the one that minimises the travel time of the ray. 

Fermat showed that his principle implied the truth of the laws of reflection and refrac¬ 
tion (as well as predicting straight rays in a homogeneous medium). Fermat’s original 
principle is a beautifully simple and general statement about the paths taken by light rays 
but, sadly, it is not quite correct. The correct version is as follows: 


Fermat’s principle 

The actual path taken by a light ray between two fixed points makes the travel time 
of the ray stationary. 


The difference between the original and correct versions is that the path taken by the 
ray does not neccessarily make the travel time a minimum, but it does make the travel time 
stationary.* In practice, the travel time is usually a minimum, but there are exceptional 
cases where it is not. 

If the free-space speed of light is c, then the speed of light at a point of a medium 
where the refractive index is n is c/n. A (hypothetical) path V in the medium would 


* Surprisingly, incorrect statements about Fermat’s principle abound in the literature. It is often claimed 
that ‘the path of a light ray makes the travel time a minimum or (occasionally) a maximum'. This is 
untrue. The path of a ray can never make T a maximum. It usually makes T a minimum, occasionally it 
provides neither a minimum nor a maximum, but it never provides a maximum. 
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therefore be traversed in time T given by the line integral 



(13.23) 


Since paths that make T stationary are extremals of T we can restate Fermat’s pineiple in 
the elegant form: 


Fermat’s principle - Classy version 


The paths of light rays in a medium are the same as the extremals of the functional 
T for that medium. 


Suppose that the refractive index n in the medium depends only on y (as in Figure 
13.5) and consider rays that lie in the (x, v)-plane. Then a ray that connects the points 
(-To* >’o) and (jci, yi) must be an extremal of the functional T which, in Cartesian coordi¬ 
nates, takes the form 



(13.24) 


where v means dy/dx, and n = n(y). Since n does not depend upon x, we may use the 
integrated form (13.21) of the E-L equation, which gives 


n 


= constant. 


(1 + V 2 )'/2 


If we write v = tan \f/, where \fs is the angle between the tangent to the ray and the jc-axis 
(see Figure 13.5), then this equation becomes 


n cos y\r = constant, 


exactly as anticipated from Snell’s law for layered media. 

Question A puzzle 

When n = //(>), it is easy to verify that the straight lines y = constant arc not 

extremals of T[y ] and arc therefore not rays (although they do satisfy Snell’s law!). 

But since such a ‘ray’ would experience a constant value of n, how does the ray know 

that it must bend? 

Answer 

Your physics lecturer will be pleased to answer this question. ■ 

The variational approach really comes into its own however when we extend our the¬ 
ory to other inhomogeneous media, where the correct generalisation of Snell’s law is dif¬ 
ficult to spot. There is no such difficulty with the variational approach; Fermat’s principle 
still holds. The case of a light ray propagating in an axially symmetric medium is solved 
in Problem 13.9. From Fermat’s principle, this is quite straightforward, but, starting from 
Snell’s law, one would probably guess the wrong formula! 
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q = q i and q = qi in the time interval 


FIGURE 13.6 Hamilton’s principle Of all 

the kinematically possible trajectories of a 
system that connect the configurations 


(t 0 ,q 0 ) 



^y (*i,9i) 


|7i, the actual motion q * (t) makes the 
action functional of the system stationary. 
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13.4 HAMILTON’S PRINCIPLE 


Hamilton's principle is the variational principle that is equivalent to Lagrange’s 
equations of motion. The comparison with geometrical optics is that Hamilton’s principle 
corresponds to Lagrange’s equations as Fermat’s principle corresponds to Snell’s law. We 
consider first the special case of systems with one degree of freedom. 

Systems with one degree of freedom 

Consider a Lagrangian system with a single generalised coordinate q and Lagrangian 
L(q,q, t). Then the trajectory q*(t) is an actual motion of the system if, and only if, it 
satisfies the Lagrange equation 



(13.25) 


It is impossible not to notice that equation (13.25) is the Euler-Lagrange equation one 
would get by making stationary the functional S[q ] defined by 



(13.26) 


The scalar quantity S is called the action and the functional S[q ] is called the action 
functional corresponding to the Lagrangian L (for the time interval f/o, t \]). 

From this simple observation, it follows that q*(t) is an actual motion of the system 
if, and only if, it makes the action functional S [< 7 ] stationary. The situation is as shown 
in Figure 13.6. This is Hamilton’s principle for a mechanical system with one degree of 
freedom. 

Example 13.4 Hamilton’s principle 

A certain oscillator with generalised coordinate q has Lagrangian 



Verify that q* = sin t is a motion of the oscillator, and show directly that it makes the 
action functional S\q ] satationary in any time interval [0, r]. 
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Solution 

Lagrange’s equation corresponding to the Lagrangian L = \q 2 — \q 2 is 

<7+4=0. 


Since q* satisfies this equation, it is a motion of the oscillator. 
Let h(t) be an admissible variation. Then 


S[q ’ 


+ h ] — S[<y* ] = j J ^(cos t + ii)~ — (sin t + h) 2 — cos 2 t + sin 2 t J dt 

= 1 j (2/' cos t + h 2 — 2/i sin t — h 2 ^ dt 




since // ( 0 ) = //(r) = 0 . It follows that 
|S[4* + /H-S[4*1I <\r( 


max |//(r )| 2 + max |//( 0| 2 

0<t<T 


\0 <t<T 


ir ( ms 


max |/?(/)|+ max |//(/)| 
\o<r<T 0<r<r 


) : 


= 2 r 


Hence 

S[q* + /,]-S[q*] = 0( ||//|| 2 ), 

which, by definition, means that q* makes the action functional S[q ] stationary. [It 
may not make S[q ] a minimum.| ■ 


Systems with many degrees of freedom 

Hamilton’s principle can be extended to systems with any number of degrees of freedom. 
In this more general case, the system has generalised coordinates q — {q \, q 2 ,. •., </„), 
the Lagrangian has the form L = L(q.q,t), and Lagrange’s equations of motion are the 
n simultaneous equations 


— (—) - — = 0 
dt \dqjJ dqj 

The action functional is now defined to be: 


(1 < j < n). 


(13.27) 


Definition 13.3 Action functional The functional 


S[tf] 


-f 

Jto 


L(q. q, t) dt 


(13.28) 


is called the action functional corresponding to the Lagrangian L(q,q,t) (for the time 
interval [fin Hi)- 


Copyrighted material 




13.4 Hamilton’s principle 


385 


The notation 5[^r] is really a shorthand form for S[qi, < 72 ,.. ., q n ] so that now 
there are n functions that can be varied by the n independent variations h\, h 2 ,..., h n 
respectively. In the vector notation, such a variation is denoted by h, where h = 
(h 1 , h-2, • • •, h„). The theory that we have developed does not cover the case where the 
functional has more than one ‘independent variable’, but it can be extended to do so. An 
outline of this extension is as follows: 


Consider the general situation in which 


y[ jc] = f F(x,x,t)dt, 

Ja 

where the vector function jc(0 = Cvi(f),*2(0* • • • **/?(0)« By using the same argument as before, the 
variation in J caused by the admissible* variation h in x* is found to be 


J\x* + h \ - J\x*]= hjdt + O (||/t|| 2 ), 


where \ \h\\ 2 = \\h\ || 2 + 11/?211 2 + • • • + \\h n \\^. This variation is linear in h with an error term of order 
\\h\\ 2 . As before we say that x* makes */[*] stationary if the linear term is zero, leaving only the error 
term. 

Definition 13.4 Stationary J The vector function x*(t ) is said to make the functional 
J[x] stationary if 


J[X* + h] - 7[X*1 = o(||/t|| 2 ) 


when 11 /; 11 is small. 

If x* makes the functional y [ jr J stationary then 




hj dt = 0 , 


for all admissible variations h. By allowing each of the {xj } to vary separately (while the others remain 
constant), the ‘bump function’ argument can be applied exactly as before to show that 


dF * . d (dF \ 

— (x\x*,t)~ — —(x*,x\t)) =0 

dxy dt \dxj ) 


(I < j < n). 


This is the same as saying that x* must satisfy the simultaneous Euler-Lagrange equations 


dF d ( dF \ 
dxj dt \dxj) 


(1 < j < n). 


Our result is summarised as follows: 


* The vector variation h is admissible if h(a) — h(b) — 0, that is, if h i, h2, • • • , h n are all admissible. 
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Euler-Lagrange equations with many variables 

The vector function jc* makes the integral functional 


J [jc] = f F (x, x, t) dt 
J a 


stationary if, and only if, x* satisfies the simultaneous Euler-Lagrange differential 
equations 


d (dF\ dF 
dt V dxj J dxj 


(1 < j < n). 


This is a natural generalisation of the single variable theory and corresponds to the 
elementary result that a function of n variables f(x \, xi ,..., x„) has a stationary point if, 
and only if, all its first partial derivatives vanish at that point. 

The statement of Hamilton’s principle for systems with many degrees of freedom is 
therefore: 


Hamilton’s principle 

The trajectory q*{t) is an actual motion of a mechanical system if, and only if, q* 
makes the action functional of the system stationary. 


The only essential difference between this correct version of Hamilton’s principle and 
the original version (quoted at the beginning of the chapter) is that an actual motion of the 
system does not necccssarily make the action functional a minimum, but it always makes 
the action functional stationary* In practice, the action functional is usually minimised, 
but there are exceptional cases where it is not (see Problem 13.11). 

As with Fermat’s principle, there is a classy version of Hamilton’s principle, which is 
less wordy and more satisfactory generally. It makes use of the concept of the extremals 
of J , which are simply solutions of the n simultaneous Euler-Lagrange equations. 


Hamilton’s principle - Classy version 

The actual motions of a mechanical system are the same as the extremals of its action 
functional. 


Incorrect statements about Hamilton’s principle also abound in the literature. It is often claimed that an 
actual motion the system makes the action functional a minimum or (occasionally) a maximum’. This 
is untrue. It is not possible to make S a maximum. The actual motion of the system usually makes S a 
minimum, occasionally it provides neither a minimum nor a maximum, but it never provides a maximum. 
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Significance of Hamilton’s principle 

Since Hamilton’s principle is equivalent to Lagrange’s equations, it can be regarded as the 
fundamental postulate of classical mechanics, instead of Newton’s laws,* for any mechan¬ 
ical system that has a Lagrangian. It should be emphasised that this is not a new theory 
- the Newtonian theory is correct — but an alternative route to the same results. Thus 
we can derive Lagrange’s equations of motion from the Newtonian theory (as we did) or, 
more directly, from Hamilton’s principle. Because Hamilton’s principle can be extended 
to apply to a wide range of physical phenomena while the Newtonian theory can not, 
Hamilton's principle is regarded as the more fundamental. 

The problem with taking Hamilton’s principle as the fundamental postulate of clas¬ 
sical mechanics is that, had one not been exposed to the traditional treatment, one would 
have no idea what the Lagrangian ought to be for any particular system. To convince 
oneself of the difficulties involved, it is instructive to read Landau’s [6| ‘derivation’ of the 
Lagrangian for the simplest system imaginable - a single particle moving in free space. 
Indeed, it seems difficult to introduce the concept of mass convincingly at all. Neverthe¬ 
less, this is the route that must be followed when Hamilton’s principle is extended, for 
example, to particle physics. The Lagrangian has to be found by intelligent guesswork, 
and, in particular, by taking account of all the symmetries that are known to exist. 

Within classical mechanics itself, it may appear that Hamilton’s principle has told us 
nothing new. It says that the motions of a mechanical system are the same as the extremals 
of the action functional, that is, the motions satisfy Lagrange’s equations; this we already 
knew. However, because the equations of motion have the special form associated with 
variational principles, they can be shown to possess important properties that would be 
very difficult to prove directly. One example of this is the effect on the equations of 
motion of choosing a new set of generalised coordinates q' = (</J, q '-,,..., q' n ). The q' are 
known functions of the old generalised coordinates q and vice versa. The direct approach 
would be to subject the Lagrange equations (13.27) to this general transformation of the 
coordinates and sec what happens; the result would be a complicated mess. However, in 
the variational approach, one simply expresses the Lagrangian L as a function of the new 
variables, that is, L = L(q', q\t). Although L has a different functional form in terms of 
the coordinates q', its values are the same as before, so that the new action functional 





L(q',q',t)dt, 


takes the same values as the old, provided that q'(t ) and q{t) refer to the same trajectory of 
the mechanical system. It follows that, if the trajectory q{t) makes S| q | stationary, then 
the corresponding trajectory q'(t) makes stationary. Hence the extremals of S[<jr] 

map into the extremals of S[#'], and vice versa. It follows that the transformed equations 
of motion are just the same as the old ones with q replaced by q'. This fact is expressed 


* Actually, instead of the Second and Third Laws. The First Law is needed to ensure that the motion is 
observed from an inertial reference frame. 
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by saying that the Lagrange equations of motion are invariant under transformations 
of the generalised coordinates. This remarkable result clearly applies to any system of 
equations derived from a variational principle. This provides a general way of ensuring 
that any proposed set of governing equations should be invariant under a particular group 
of transformations (the Lorentz transformations, for instance). This will be so if the equa¬ 
tions are derivable from a variational principle whose ‘Lagrangian’ is invariant under the 
same group of transformations. 


Problems on Chapter 13 


Answers and comments arc at the end of the book. 
Harder problems carry a star (*). 


Euler-Lagrange equation 


13.1 Find the extremal of the functional 



that satisfies v(l) = 3 and x{2) = 18. Show that this extremal provides the global minimum 
of J . 

13.2 Find the extremal of the functional 



that satisfies .v(0) = .v(7r) = 0. Show that this extremal provides the global maximum of J. 

13.3 Find the extremal of the path length functional 



that satisfies y(0) = y(l) = 0 and show that it does provide the global minimum for L. 

13.4 An aircraft flies in the (x, z)-planc from the point (—a, 0) to the point (a, 0). (z = 0 is 
ground level and the z-axis points vertically upwards.) The cost of flying the aircraft at height 
z is exp(— kz) per unit distance of flight, where k is a positive constant. Find the extremal for 
the problem of minimising the total cost of the journey. [Assume that ka < 7r/2.] 

13.5* Geodesics on a cone Solve the problem of finding a shortest path over the surface 
of a cone of semi-angle a by the calculus of variations. Take the equation of the path in the 
form p = p(0), where p is distance from the vertex O and 6 is the cylindrical polar angle 
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measured around the axis of the cone. Obtain the general expression for the path length and 
find the extremal that satisfies the end conditions p(—n/2 ) = p{n / 2) = a. 

Verify that this extremal is the same as the shortest path that would be obtained by devel¬ 
oping the cone on to a plane. 

13.6 Cost functional A manufacturer wishes to minimise the cost functional 

/*4 

C [x ] = J ((3 + x)x + 2.y) dt 

subject to the conditions x(0) = 0 and x (4) = X , where X is volume of goods to be produced. 
Find the extremal of C that satisfies the given conditions and prove that this function provides 
the global minimum of C. 

Why is this solution not applicable when X < 8? 


13.7 Soap film problem Consider the soap film problem for which it is required to minimise 


/[j] = J >’ (l + j 2 ) 1 di 


with y(—a) = y (a) = b. Show that the extremals of J have the form 


y = c cosh 


(;+-)• 


where c, d are constants, and that the end conditions are satisfied if (and only if) d = 0 and 


cosh A. = ( — ) A., 


where A. = a/c. Show that there are two admissible extremals provided that the aspect ratio 
b/a exceeds a certain critical value and none if b/ci is less than this crirical value. Sketch a 
graph showing how this critical value is determined. 

The remainder of this question requires computer assistance. Show that the critical value 
of the aspect ratio b/a is about 1.51. Choose a value of b/a larger than the critical value 
( b/a = 2 is suitable) and find the two values of A.. Plot the two admissible extremals on the 
same graph. Which one looks like the actual shape of the soap film? Check your guess by 
perturbing each extremal by small admissible variations and finding the change in the value of 
the functional J [ y ]. 


Fermat’s principle 

13.8 A sugar solution has a refractive index n that increases with the depth z according to the 
formula 


/ z\ 1/2 

" = "°( l + 7/ • 

where no and a are positive constants. A particular ray is horizontal when it passes through 
the origin of coordinates. Show that the path of the ray is not the straight line z = 0 but the 
parabola z = x 2 /4 a. 
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13.9 Consider the propagation of light rays in an axially symmetric medium, where, in a 
system of cylindrical polar co-ordinates (/-, 0 , z), the refractive index n = n(r ) and the rays lie 
in the plane z = 0. Show that Fermat’s time functional has the form 


T[r] = c~' 




dO, 


where r — r(0) is the equation of the path, and r means dr/dO. 
(i) Show that the extremals of T satisfy the ODE 


(>- 2 +,- 2 ) 1/2 


= constant. 


Show further that, if we write r = r tan xfr, where \[r is the angle between the tangent to 
the ray and the local cylindrical surface r = constant, this equation becomes 

r n cos xjr = constant, 

which is the form of Snell’s law for this case. Deduce that circular rays with centre at 
the origin exist only when the refractive index /, = a/r, where a is a positive constant. 

Hamilton’s principle 

13.10 A particle of mass 2 kg moves under uniform gravity along the z-axis, which points 
vcrically downwards. Show that (in SI units) the action functional for the time interval [0, 2] 
is 


S[z] = J (z 2 + 20z^)dt, 
where g has been taken to be 10 m s -2 . 

Show directly that, of all the functions z(/) that satisfy the end conditions z(0) = 0 and 
z(2) = 20, the actual motion z = 5 1 2 provides the least value of S. 

13.11 A certain oscillator with generalised coordinate q has Lagrangian 

L=q 2 -4q 2 . 

Verify that q* = sin 2 1 is a motion of the oscillator, and show directly that it makes the action 
functional S[g ] satationary in any time interval [0, r]. 

For the time interval 0 < t < tt, find the variation in the action functional corresponding 
to the variations (i) // = e sin4t, (ii) h = € sin t, where e is a small parameter. Deduce that the 
motion q* = sin It docs not make S a minimum or a maximum. 

13.12 A particle is constrained to move over a smooth fixed surface under no forces other than 
the force of constraint. By using Hamilton’s principle and energy conservation, show that the 
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FIGURE 13.7 The path of quickest descent from P to 
Q in Cosine Valley. Those who lose their nerve at the 
summit can walk down by the shortest path (shown 
dashed). 


path of the particle must be a geodesic of the surface. (The term geodesic has been extended 
here to mean those paths that make the length functional stationary). 

This result has a counterpart in the theory of general relativity, where the concept of force does not exist 
and particles move along the geodesics of a curved space-time. 

13.13 By using Hamilton’s principle, show that, if the Lagrangian L(q , q, t ) is modified to 
L' by any transformation of the form 

. d 

L = L + — g(q, t), 
dt 

then the equations of motion arc unchanged. 

Computer assisted problems 

13.14 Geodesics on a paraboloid Solve the problem of finding the shortest path between 
two points P{ 0, 1, 1) and (2(0, —1, 1) on the surface of the paraboloid z = x 2 + y 2 . 

Let C be a path lying in the surface that connects P and Q. Show that the length of C is given by 

where r = r(0) is the polar equation of the projection of C on to the plane z — 0, and f means dr/dO. 
Now find the function r(0) that minimises L. It is easier to work directly with the second order E-L- 
equation (which can be found with computer assistance). Solve the E-L-equation numerically with the 
initial conditions /•(()) = X, r(0) = 0 and choose k so that the path passes through P (and, by symmetry, 
Q). Plot the shortest path using 3D graphics. 
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Hamilton’s equations and phase space 


KEY FEATURES 

The key features of this chapter are the equivalence of Lagrange’s equations and Hamilton’s 
equations, Hamiltonian phase space, Liouville’s theorem and recurrence. 


In this chapter we show how Lagrange’s equations can be reformulated as a set of first 
order differential equations known as Hamilton’s equations. Nothing new is added to the 
physics and the Hamilton formulation is not superior to that of Lagrange when it comes 
to problem solving. The value of Hamilton’s supremely elegant formulation lies in pro¬ 
viding a foundation for theoretical extensions both within and outside classical mechan¬ 
ics. Within classical mechanics, it is the basis for most further developments such as the 
Hamilton-Jacobi theory and chaos. Elsewhere, Hamiltonian mechanics provides the best 
route to statistical mechanics, and the notion of the Hamiltonian is at the heart of quantum 
mechanics. As applications of the Hamiltonian formulation, we prove Liouville’s theorem 
and Poincare’s recurrence theorem and explore some of their remarkable consequences. 


14.1 SYSTEMS OF FIRST ORDER ODES 


The standard form for a system of first order ODEs in the n unknown functions 
xi(t),x 2 (t), ...,x n (t) is 

x\ = F\(x\,x 2 , ...,x n ,t), 
x 2 = F 2 (x u x 2 ,...,x„,t), 

: : 04.1) 

X n = F n (Xi,X 2 ,...,X„,t), 

where F \, F 2 ,..., F n are given functions of the variables x\, x 2 ,..., x„, t. This can also 
be written in the compact vector form 


x = F(x,t), (14.2) 

where x and F are the /t-dimensional vectors jc = (x\, x 2 ,..., x n ) and F = 

(F i, F 2 , ..., F„). If the value of x is given when t = to, the equations x = F(x, t ) 
determine the unknowns x(t) at all subsequent times. 
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A typical example is the predator-prey system of equations 


x\ = ax | - bx ix 2 , 

X 2 = bx\x 2 — CX 2 , 

which govern the population density x\(t ) of the prey and the population density x 2 (t ) of 
the predator. In this case, F\ = ax\ — bx\x 2 , and F 2 = bx\x 2 — cx 2 . 

Converting higher order equations to first order 

I ligher order ODEs can always be converted into equivalent systems of first order ODEs. 
For example, consider the damped oscillator equation 

x + 3x + 4x = 0. (14.3) 

If we introduce the new variable v, defined by v = x, then the second order equation 
(14.3) for x(t) can be converted into the pair of first order equations 

x — v 

v = —3x — 4v, 

in the unknowns {x(t), v(t)}. Since this step is reversible, this pair of first order equations 
is equivalent to the original second order equation (14.3). More generally: 

Any system of n second order ODEs in n unknowns can be converted into an equivalent 
system of2n first order ODEs in 2n unknowns. 

Consider, for example, the orbit equations for a particle of mass m attracted by the grav¬ 
itation of a mass M fixed at the origin O. In terms of polar coordinates centred on O, the 
Lagrangian is 

L = \m (r 2 +r 2 0 2 ) + —-—. 
and the corresponding Lagrange equations are 

.9 MG • 9 

r — rd 2 — - t— , 2rr0 + r 2 0 — 0, 

r z 

a pair of second order ODEs in the unknowns {r(t), 0(t)}. If we now introduce the new 
variables v r and vg, defined by 


v r = r , v 0 = 0, 

the two second order Lagrange equations can be converted into 

r = v r , i) r = rVg — MG /r 2 , 

0 = Vg, Vg = -2 V r Vg/r, 

an equivalent system of four first order ODEs in the four unknowns {/•, 0, v r , vg}. 
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Hamilton form 

In the above examples, we have performed the conversion to first order equations by intro¬ 
ducing the coordinate velocities as new variables. This is not the only choice however. 
When transforming a system of Lagrange equations to a first order system, we could 
instead take the congugate momenta, defined by 


Pi = 


dL 

Wi' 


(14.4) 


as the new variables. This seems quite attractive since the Lagrange equations already 
have the form 


dL 

Pi = 7— (!<;'< n). 

dqj 

The downside is that the right sides 9 L /dqj are functions of q , q, t, and must be trans¬ 
formed to functions of q, p,t. This is achieved by inverting the equations (14.4) to express 
the q as functions of q, p, t. 

For the Lagrangian in the orbit problem, the conjugate momenta are 

dL d L 2• 

p r = — = mr, po = — r- = mr 0 , 

dr 90 


and these equations are easily inverted* to give 





0 = 


Po 

mr 2 


The two second order Lagrange equations for the orbit problem are therefore equivalent 
to the system of four first order ODEs 




0 = 


Pr 

Po 

> 

Pr - 

m 

mr 

Po 

Po = 0, 

mr 2’ 


n\M G 


in the four unknowns {/•, 0, p r , pe). This is the Hamilton form of Lagrange’s equations 
for the orbit problem. 


Why bother? 

The reader is probably wondering what is the point of converting Lagrange’s equations 
into a system of first order ODEs. For the purpose of finding solutions to particular prob¬ 
lems, like the orbit problem above, there is no point. Indeed, the new system of first order 


* This step is difficult in the general case where there are n coupled linear equations to solve for q \,..., q n . 
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equations may be harder to solve than the original second order equations. The real interest 
lies in the structure of the general theory. When Lagrange’s equations are expressed in 
Hamilton form* in a general manner , the result is a system of first order equations of 
great simplicity and elegance, now known as Hamilton’s equations. These equations arc 
the foundation of further developments in analytical mechanics, such as the Ilamilton- 
Jacobi theory and chaos. Also, the Hamiltonian function, which appears in Hamilton’s 
equations, is at the heart of quantum mechanics. 


14.2 LEGENDRE TRANSFORMS 


The general problem of converting Lagrange’s equations into Hamilton form 
hinges on the inversion of the equations that define p , namely, 


Pj = 



L(q, q,t) 


(!<;'< n), 


(14.5) 


so as to express q in terms of q, p, t. This inversion is made easier by the fact that 
the {pj } are not general functions of q , q and t, but are the first partial derivatives of a 
scalar function, the Lagrangian L(q, q, t). It is a remarkable consequence that the inverse 
formulae can be written in a similar way. 1 ^ The details of the argument follow below and 
the results are summarised at the end of the section. 


The two-variable case 

We develop the transformation theory for the case of functions of two variables. This has 
all the important features of the general case but is much easier to follow. Suppose that v\ 
and i >2 are defined as functions of the variables u\ and u 2 by the formulae 


ui 


dF 

9m? 


v 2 


dF 


(14.6) 


where F{u\,u 2 ) is a given function of u\ and u 2 . Is it possible to write the inverse 
formulae 5 in the form 


til = 


a G 
ihq ’ 


M 2 = 


dG 

8V~ 2 ' 


(14.7) 


* This is the form in which the new variables are the conjugate momenta p \,..., p„. The form in which the 
new variables are the generalised velocities cj \,..., q„ does not lead to an elegant theory, and is therefore 
not used. It is often claimed that it is not possible to take the generalised velocities as new variables 
because ‘they are the time derivatives of the generalised coordinates and therefore cannot be independent 
variables’. This objection is baseless, as the previous examples show. Indeed, if this objection had any 
substance, the conjugate momenta would be disqualified as well! 

' There is a neat way of seeing that this must be true, which may appeal to mathematicians. If v = 
grad H F(u), then the Jacobian matrix of the transformation from u to v is symmetric. It follows that the 
Jacobian matrix of the inverse tranformation must also be symmetric, which is precisely the condition 
that the inverse transformation has the form u = gradp G(v). 

¥ We will always suppose that the inverse transformation does exist. 
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for some function G(hi, U 2 )? In the simplest cases one can answer the question by grind¬ 
ing through the details directly. For example, suppose F = 2+ 3u\U2 + mT hen 


vi = 4u\ + 3 m 2 , 
i>2 = 3u\ + 2 m 2 . 

The inverse formulae are easily obtained by solving these equations for Mi, 112 , which 
gives 


M | = -2v\ + 3 h 2 , 
M2 = 3hj — 4^2- 


There is no prior reason to expect that these formulae for Mi, M 2 can be expressed in 
terms of a single function G(h 1 , V 2 ) in the form (14.7), but it is true because the right 
sides of these equations happen to satisfy the necessary consistency condition.* Simple 
integration then shows that (to within a constant) 

G = —v^ + 3v\V2 — 2v\. 

This result is not a coincidence. Let F(u\, M 2 ) now be any function of the variables m 1 , 
M 2 , and suppose that a function G(v\, V 2 ) satisfying equation (14.7) docs exist. Consider 
the expression 


X = F(mi, M 2 ) + G(v\, V 2 ) - (mi ui + U 2 V 2 ), 

which, as it stands, is a function of the four independent variables u \ , M2, Hi , V2. Suppose 
now that, in this formula, we imagine* that U| and V 2 are replaced by their expressions in 
terms of mi and M2. Then X becomes a function of the variables m i and M2 only. Its partial 
derivative with respect to u 1 , holding 112 constant, is then given by 



= 0 + 0 + 0 = 0 , 


* If //j = f\(v\, V2) and ii2 = f 2 ( v 1 * ^ 2 ) then it is possible to express u\, z /2 in the form (14.7) only if the 
functions f\ and /? are related by the formula 

dl)2 

which is called the consistency condition. 

^ Pure mathematicians strongly object to such feats of imagination. Unfortunately, the alternative is to 
introduce a welter of functional notation which obscures the essential simplicity of the argument. We 
will make frequent use of such ‘imagined’ substitutions. 
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Active and passive variables 

The variables u = (u\, M2) and v = (tq, V2 ) are called active variables because they are 
the ones that are actually transformed. However, the functions F and G may also depend 
on additional variables that are not part of the transformation as such, but have the status 
of parameters. These are called passive variables. In the dynamical problem, q and p are 
the active variables and q is the passive variable. We need to find how partial derivatives 
of F and G with respect to the passive variables arc related. 

Suppose then that F = F(u\,u 2, tu) and G = G(v\,V2,u>) satisfy the formulae 
(14.6) and (14.7), where w is a passive variable. Then (14.6) defines iq, V2 as functions 
of mi, m2 and w, and (14.7) defines mi, M2 as functions of v\, V2 and w. The argument 
leading to the formula (14.8) still holds so that 

F{U\,U2, xv) + G(V\,V2,U>) = U \ V\ + M 2 V 2 - (14.10) 


In this formula, imagine that v\ and V 2 are replaced by their expressions in terms of mj, 
M 2 and w, then differentiate the resulting identity with respect to w, holding Mi and M 2 
constant. On using the chain rule, this gives 



3 G dv\ 3 G dv 2 3 G\ 

-x- 1 -x- 1 - 1 

3iq dw di >2 3 w dw) 


dv\ dv 2 
) = Ml --b M2 


3 w 


3 w ’ 


which can be written 


3 F 3 G 
3 w 8w 




3ui 

dw 



di>2 

dw 


= 0 + 0 = 0 , 


on using the relations (14.7). Hence the partial derivatives of F(mi,M 2 ,u;) and 
G (t>i, V 2 , w) with respect to w are related by 


dF 

3 G 

dw 

dw 


(14.11) 


This is the required result; it holds for each passive variable w. 

The general case with many variables 

The preceding theory can be extended to any number of variables. The results are exactly 
what one would expect and are summarised in the box below. This summary is presented 
in a compact vector form using the m- dimensional ‘grad’.* It is a good idea to write these 
results out in expanded form. 


* If F = F(u . u>), where u = (iq, 112, ■ ■ ■, u„) and w = (tuj, wj ,..., w m ), then grad H F and grady, F 
mean 


grad M F(u. w) 


f dF dF 

Va « 7 ' 802' 


dF \ f dF dF dF \ 

- —J, gradu, F(m, w ) = I -—-)• 

dllfj ) \dwi ow2 oU) m / 
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which can be written in the vector form 

p = grad^ L(q, q,t). (14.18) 

The first step is to eliminate the coordinate velocities q from the Lagrange equations 
in favour of the momenta p. This in turn requires that the formula (14.18) must be inverted 
so as to express q in terms of q, p and t. This is precisely what Legendre transforms do. 
It follows from the theory of the last section that the inverse formula to (14.18) can be 
written in the form 


q = grad pH (q, p,t), (14.19) 

where the function H(q, p,t ) is the Legendre transform of the Lagrangian function 
L(q,q , t ). Here, q and p are the active variables and q is the passive variable. 

Definition 14.1 Hamiltonian function The function H (q. p. t), which is the Legendre 
transform of the Lagrangian function L(q, q, t), is called the Hamiltonian function of 
the system S. 

Since the functions H and L are Legendre transforms of each other, they satisfy the 
relations 


H (q, p.t) = q • p - L(q, q , t) 


(14.20) 


which can be used to generate H from L , and 

grad^ Liq.q.t ) = -grad^ H{q. p,t), (14.21) 

which connects the derivatives of L and H with respect to the passive variables. 

It is now quite easy to perform the transformation of Lagrange’s equations. The 
Lagrange equations (14.16) can be written in terms of the generalised momenta {p ,} in 
the form 

dL 

Pj = 7 — (1 < j < n), 

dqj 

which is equivalent to the vector form 

p = grudq L(q. q,t). (14.22) 

The right sides of these equations still involve q, but, on using the formula (14.21), we 
obtain 


p = -grad^ H(q, p,t). (14.23) 

These are the transformed Lagrange equations! The Hamilton form of the Lagrange equa¬ 
tions therefore consists of equations (14.23) together with equations (14.19), which effec¬ 
tively define the generalised momentum p. 
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All of the above argument is reversible and so the Hamilton form and the Lagrange 
form are equivalent. Our results are summarised below: 


Hamilton’s equations 

The n Lagrange equations (14.16) are equivalent to the system of 2 n first order 
ODEs 


q = grad p H (q, p, t), p = - grad^ H (q, p, t), (14.24) 


where the Hamiltonian function H(q,p,t ) is the Legendre transfonn of the 
Lagrangian L(q,q,t ) and is generated by the formula (14.20). This is the vector 
form of Hamilton’s equations.* The expanded form is 


( ij = 




(1 <j <n). 


We have shownthat the n second order Lagrange equations in the n unknowns q(t) 
arc mathematically equivalent to the 2 n first order Hamilton equations in the 2n unknowns 
q(t ), p{t). In each of these formulations of mechanics, the motion of the system is deter¬ 
mined by the form of a single function, the Lagrangian L{q,q<t) in the Lagrange formu¬ 
lation, and the Hamiltonian H(q, p , t) in the Hamilton formulation. Hamilton’s equa¬ 
tions are a particularly elegant first order system in which the functions Fi, Fj ,... that 
appear on the right are simply the first partial derivatives of a single function, the Hamil¬ 
tonian H. Moreover these right hand sides also satisfy the special condition 7 div F = 0, 
which allows Liouville’s theorem to be applied to Hamiltonian mechanics.* 

Explicit time dependence 

One final note. When the Lagrangian has an explicit time dependence, this t has the status 
of an extra passive variable. It follows that we then have the additional relation 


dL _ dH 
~dt ~ ~~dT' 


(14.25) 


* After Sir William Rowan Hamilton, whose paper Second Essay on a General Method in Dynamics was 
published in 1835. Hamilton’s equations are sometimes eallcd the canonical equations; no one seems to 
know the reason why. 

T The scalar quantity div F is defined by 


div F 


dF i dFi dF„ 

— -b T — H - b — -• 

d.Y i (i\'2 dX„ 


* None of these statements is true if Lagrange’s equations are expressed as a first order system by taking 
the coordinate velocities q as the new variables. 
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which shows that if either of L or H has an explicit time dependence, then so does the 
other. 


Example 14.2 Finding a Hamiltonian and Hamilton’s equations 


Find the Hamiltonian and Hamilton’s equations for the simple pendulum. 


Solution 

The Lagrangian for the simple pendulum is 

L = \ma 2 9 2 + mga cos 9, 

where 9 is the angle between the string and the downward vertical, /// is the mass of 
the bob, and a is the string length. The momentum p 0 conjugate to the coordinate 9 
is given by 

dL 

pe = —- = ma~9 
89 

and this formula is easily inverted to give 

0=-^. (14.26) 

ma z 

The Hamiltonian H is then given by 


H = 0 po - L, 


where 9 is given by equation (14.26). This gives 


„ / Pe \ | 2 / Po \ 2 a 

II = I-=•) pe - ?ma ( ^ ) - mga cos 9 

\ma z / \ma z I 


<ma 

Pe 

2m a 2 


— mga cos 9. 


This is the Hamiltonian for the simple pendulum. From H we can find Hamilton’s 
equations. They arc 


dll = Pe 
dps ma 1 ' 


Po 


a// 

—- — —msa sin 9. 
39 


These are Hamilton’s equations for the simple pendulum. 

This simple example illustrates clearly why Lagrange’s equations are preferred 
over Hamilton’s equations for the practical solution of problems. To solve Hamilton’s 
equations in this case, we would differentiate the first equation with respect to t and 
then use the second equation to eliminate the unknown pe. This gives 

9 ^ + — sin 9 — 0, 
a 

which is precisely the Lagrange equation for the system! ■ 
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where T ( q , p) is the kinetic energy of the system expressed in terms of the variables q , p. 
In this case, H is simply the total energy of the system, expressed in terms of the variables 
q, p. This is the quickest way of finding H when the system is conservative. 


Example 14.3 Finding a Hamiltonian 2 


Find the Hamiltonian for the inverse square orbit problem considered earlier and 
deduce Hamilton’s equations for this system. 

Solution 

This is a conservative system so that // = T + V . With the polar coordinates r and 
6 as generalised coordinates, T and V are given by 

i /.^ 9 • 9 \ m MG 

T = \m (r 2 + r 2 d 2 J V = -. 

and the generalised momenta are given by 

3L dL 2 • 

Pr = "tt = mr, pe = -x = mr 6. 

dr d6 

These equations are easily inverted to give 




Pr 


III 


0 = 


Pe 

,2 


Dll 


so that the Hamiltonian is given by 

i /.-> -y-o\ m MG 

H = T + V = \m [r 2 + r 2 6 2 j - 

- EL _i_ _ " /MG 

2 m 2m r 2 r 


This is the required Hamiltonian. Hamilton’s equations are now found by using this 
Hamiltonian in the general equations (14.25). The partial derivatives of II arc 

dH__P L dll _ p 0 d_H_ _ p 2 d_H__ o 
3 p r in' dpo mr 2 ’ dr mr 3 ’ 3 0 

and Hamilton’s equations for the orbit problem arc therefore 




Pr 

///’ 


Pr = 


Pe 


mr 


m MG 

T 2 


0 = 


Pe 

1 ’ 
mr L 


pe = 0. 


Naturally, these are the same equations as were obtained earlier by ‘manual’ trans¬ 
formation of Lagrange’s equations. As in the last example, solution of the Hamilton 
equations by eliminating the momenta simply leads back to Lagrange’s equations. ■ 
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Momentum conservation 

From the Hamilton equation pj = — 3 H/dqj, it follows that: 

IfdH/dqj = 0 (that is, if the coordinate qj is absent from the Hamiltonian), then the 
generalised momentum pj is constant in any motion. 

The corresponding result in the Lagrangian formulation is that: 

IfdL/dqj = 0 (that is, if the coordinate q j is absent from the Lagrangian), then the 
generalised momentum pj is constant in any motion. 

These two results seem slightly different, but they are equivalent, since, from equation 
(14.21), dH/dqj — —dL/dqj. This means that the term cyclic coordinate, by which 
we previously meant a coordinate that did not appear in the Lagrangian, can be applied 
without ambiguity to mean that the coordinate does not appear in the Hamiltonian. Our 
result is then: 


Conservation of momentum 

If qj is a cyclic coordinate (in the sense that it does not appear in the Hamiltonian), 
then pj, the generalised momentum conjugate to qj, is constant in any motion. 


14.4 HAMILTONIAN PHASE SPACE {(q, /?)-space) 

Suppose the mechanical system S has generalised coordinates q, conjugate 
momenta p, and Hamiltonian H(q, p,t). If the initial values of q and p are known,* 
then the subsequent motion of S, described by the functions |</(0' P (Oh is uniquely 
determined by Hamilton’s equations. This motion can be represented geometrically by 
the motion of a ‘point’ (called a phase point) in Hamiltonian phase space. Hamilto¬ 
nian phase space is a real space of 2 n dimensions in which a ‘point’ is a set of values 
(< 7 i, < 72 , ... ,q„, p\, P 2 , ..., p n ) of the independent variables [q, p). (Note that a point in 
Hamiltonian phase space represents not only the configuration of the system S but also its 
instantaneous momenta. This is the distinction between Hamiltonian phase space, which 
has 2 n dimensions, and Lagrangian configuration space, which has n dimensions.) Each 
motion of the system S then corresponds to the motion of a phase point through the phase 
space.' 1 ’ 

The only case in which we can actually draw the phase space is when S has just one 
degree of freedom. Then the phase space is two-dimensional and can be drawn on paper. 


* We are more familiar with initial conditions in which q and q are prescribed. However, these conditions 
are equivalent to those in which the initial values of q and p are prescribed. 

T It should be noted that Hamiltonian phase space is generally not the same as the phase space introduced 
in Chapter 8, which, in the present notation, is (q. <jr)-space. In particular, our next result (Liouville’s 
theorem) does not apply in (q. ^)-space. 
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FIGURE 14.1 Typical paths in the phase space (q, p) corresponding to 
motions of a system S with Hamiltonian II = p 2 + q 2 / 9. The arrows 
show the direction that the phase point moves along each path as t 
increases. 


Example 14.4 Paths in phase space 


Suppose that S has the single coordinate q and Lagrangian 



Find the paths in Hamiltonian phase space that correspond to the motions of <S. 


Solution 

The conjugate momentum p = dL/dq = \q, and the Hamiltonian is 


H =qp - L = (2 p) p - ^ (2 p) 2 + y = p 2 + y. 
Hamilton’s equations for S are therefore 


q = 2p, p = -2q/9. 

On eliminating p, we find that q satisfies the SHM equation 

q + (4<//9) = 0. 

The general solution of the Hamilton equations for S is therefore 

q — 34 cos((2//3) + a), p — —A sin((2//3) + a), 

where A and a are arbitrary constants. These arc the parametric equations of the 
paths in phase space, the parameter being the time t; each path corresponds to a 
possible motion of the system <S. Some typical paths are shown in Figure 14.1. For 
this system , every motion is periodic so that the paths are closed curves in the ( q, p)- 
plane. (They are actually concentric similar ellipes.) The arrows show the direction 
that the phase point moves along each path as t increases. ■ 
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FIGURE 14.2 Liouville’s theorem: the Hamiltonian phase 
flow preserves volume. 


to TZq, but Liouville’s theorem* states that the volumes^ of the two regions are equal. 
This remarkable result is expressed by saying that the Hamiltonian phase flow preserves 
volume . The theorem is easy to apply, but the proof is rather difficult. 

Proof of Liouville ’s theorem 

The proof is easier to follow if we use jq, x 2 , ..., x 2n as the names of the variables (instead of q. p), and 
also call the right sides of Hamilton’s equations F\ , F 2 , ..., F 2n . Then, in vector notation, the equations of 
motion are x = F(x< f). We will give the details for the case when the phase space is two-dimensional; the 
method in the general case is the same but uglier. 

Consider a set of phase points moving in the (jq, ,Y 2 )-plane, which, at some instant in time, occupies 
the region 7Zq, as shown in Figure 14.2. Without losing generality, wc may suppose that this occurs at time 
t = 0. After time f, a typical point x of 7£o has moved on to position X = X (jc. t) and the set as a whole 
now occupies the region lZ t . In this two-dimensional case, the ‘volume’ v(t) of H, is the area of this region 
in the (jq, A' 2 )-plane. Now 

v(t)= / dX { dX 2 = / J dx x dx 2 , 

J iZrf J1Z0 

where J is the Jacobian of the transformation X = X(x , f), that is, 


J = 


dX\/dx\ dX\/dx 2 
dX 2 /dx x dX 2 /dx 2 


Now, for small /, X may be approximated by 

dX 

X(x, t) = X(x, 0) + t — (x, 0) + 0(t 2 ) 
ot 

= x + t F(x. 0) + 0(t 2 ), 


(14.29) 


on using the equation of motion i = F(x , t). The corresponding approximation for J is 
J = 1 +t 


dFi 3F 2 
arj dx 2 J/=o 


+ 0(t 2 ) = 1 + f div F(x, 0) + 0(t 2 ). 


* After the French mathematician Joseph Liouville (1809-1882). 

T Since the dimension of the phase space can be any (even) number, this is a generalisation of the notion of 
volume. 


Copyrighted material 









410 


Chapter 14 Hamilton’s equations and phase space 



FIGURE 14.3 An instance of Liouville’s theorem with the 
Hamiltonian // = p 2 + q 2 /9. The shaded region moves 
through the phase space. Its shape changes but its area 
remains the same. 


Hence the volume of 7 Z t is approximated by 


v(t) = f (\ + t div F(x, 0)) dx\dx 2 + 0(t 2 ) 

J no V 7 


when t is small. It follows that 

dv 
~dt 


(v(t) - v(0)\ f 

= lim ( -) = / div/^jc.O) dx\dx 2 - 

t =o t^o V t ) Jn 0 

Finally, since the initial instant t = 0 was arbitrarily chosen, this result must apply for any f, that is, 

dv f 

— = I div F(jc, /) d.X]dx 2 

dt Jn, 


at any time t . We see that, for general systems of equations, the phase flow does not preserve volume. 
However, if div F(x , /) = 0, then volume is preserved. For the case of Hamilton’s equations with one 
degree of freedom. 


div F 


dF i dF 2 
dx | dx 2 


3 3 ( dH\ _ 

dq\dp) + dp\ dq ) ~ 0 


Hence the Hamiltonian phase flow satisfies the condition div F = 0 and so preserves volume. This com¬ 
pletes the proof. ■ 


Liouville’s theorem 

The motions of a Hamiltonian system preserve volume in ( q , p)- space. 


A particular instance of Liouville’s theorem is shown in Figure 14.3. The phase paths 
of the Hamiltonian H = p 2 + q 2 /9, shown in Figure 14.1, are concentric similar ellipses. 
Figure 14.3 shows the progress of a region of the phase space lying between two such 
elliptical paths. The region changes shape but its area remains the same. 

Liouville’s theorem has many applications and is particularly important in statistical 
mechanics. The following is a simple example. 
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FIGURE 14.4 Consequences of Poincare’s recurrence theorem. Left: the particle sliding 
inside the smooth irregular bowl will eventually almost reassume its initial state. Right: The 
mole of gas molecules, initially all in the left compartment will eventually all be found there 
again. 


Since the recurrence theorem holds for any sub-region of7^o, it follows that, through¬ 
out TZo, there are phase points that pass arbitrarily close to their original positions. Thus 
if the system S has one of these points as its initial state, then S will eventually become 
arbitrarily close to reassuming that state. Actually, such points are typical rather than 
exceptional. To show this requires a stronger version of Poincare’s theorem* than we have 
proved here, namely: The path of almost every t point in IZq passes arbitrarily close to its 
starting point. This implies the remarkable result that for amost every choice of the initial 
conditions, the system <S becomes arbitrarily close to reassuming those conditions at later 
times. 

An example of this phenomenon is the motion of a single particle P sliding under 
gravity on the smooth inner surface of a bowl of some irregular shape z = F(x, y ), as 
shown in Figure 14.4. This is an autonomous Hamiltonian system with two degrees of 
freedom. Take the Cartesian coordinates ( x , v) of P to be generalised coordinates. Then 
the Lagrangian is given by 

L = ^m (x 2 + y 2 + z 2 ^ — mgz 

= \m (i 2 + y 2 + (F,*i + F,^y) 2 ) -mgF, 

where F, x = dF/dx and F, y = 3 F/dy. The conjugate momenta are 

p x = tn (i + (F, x x + F, y y)F , x ), p y = m (y + (F, x x + F, y y)F, y ). 

Just because energy is conserved, it does not necessarily mean that Poincare’s theorem 
applies. We must show that the energy surfaces in phase space are bounded. The proof of 
this is as follows: 


* See Walters [121. 

T This means that the set of exceptional points has measure zero. For example, in a two-dimensional phase 
space, a curve has zero measure. 
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In this case, Hq is some bounded region of the phase space (,v, y, p x , p y ). Suppose that zp is the maximum 
value of z and that Tq is the maximum kinetic energy associated with points of TZq. Then, by energy 
conservation, the maximum value of z in the subsequent motions cannot exceed z max = zp + (Tp/mg). 
Hence, providing the bowl rises to at least this height, the motions are confined to values of (.v, y) that satisfy 
F(x, y) < z max . It follows that both x and y arc bounded in the subsequent motions. Also, if the lowest 
point of the bowl is at z = 0, the value of T in the subsequent motions cannot exceed 7’ max = Tp + wgzp. 
It follows that x and y are bounded in the subsequent motions and this implies that the same is true for p x 
and p y . Hence x, y, p x , and p y arc all bounded in the subsequent motions. This means that that the paths 
of the phase points that lie in TZq when I = 0 are confined to a bounded region of the phase space for all 
time. Poincare’s theorem therefore applies. 

Hence, if the particle P is released from rest (say) at some point A on the surface of 
the bowl, then, whatever the shape of the bowl, F will become arbitrarily close to being 
at rest at A at later times. 

In the same way, it follows that if a compartment containing a mole of gas molecules 
is separated from an empty compartment by a partition, and the partition is suddenly 
punctured, then at (infinitely many!) later times the molecules will all be found in the 
first compartment again. This remarkable result, which seems to be in contradiction to 
the second law of thermodynamics, appears less paradoxical when one realises that ‘later 
times’ may mean 10 2 ° years later! 

Question Exceptional points 

How do you know that the initial conditions you have chosen do not correspond to an 

‘exceptional point’ for which Poincare’s theorem does not hold? 

Answer 

You don’t know, but you would be very unlucky if this happened! ■ 


Problems on Chapter 14 

Answers and comments are at the end of the book. 

Harder problems carry a star (*). 

Finding Hamiltonians 

14.1 Find the Legendre transfonn G (tq , V 2 , w) of the function 


F(u i, w 2 , w) = 2t/| — 3u\U2 + «2 + 3tu«i, 


where uj is a passive variable. Verify that dF/dw = —dG/dw. 

14.2 A smooth wire has the form of the helix x = acosO, y = as'inO, z = bd, where 
6 is a real parameter, and a, b arc positive constants. The wire is fixed with the axis Oz 
pointing vertically upwards. A particle P of mass m can slide freely on the wire. Taking 0 as 
generalised coordinate, find the Hamiltonian and obtain Hamilton’s equations for this system. 
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14.3 Projectile Using Cartesian coordinates, find the Hamiltonian for a projectile of mass m 
moving under uniform gravity. Obtain Hamilton’s equations and identify any cyclic coordi¬ 
nates. 

14.4 Spherical pendulum The spherical pendulum is a particle of mass m attached to a fixed 
point by a light inextensible string of length a and moving under uniform gravity. It differs 
from the simple pendulum in that the motion is not restricted to lie in a vertical plane. Show 
that the Lagrangian is 

L = jin a 2 (d 2 + sin 2 0 <p 2 ^j + niga cos 0, 

where the polar angles 0, (p arc shown in Figure 11.7. Find the Hamiltonian and obtain Hamil¬ 
ton’s equations. Identify any cyclic coordinates. 

14.5 The system shown in Figure 10.9 consists of two particles P\ and Pi connected by a light 
inextensible string of length a. The particle P\ is constrained to move along a fixed smooth 
horizontal rail, and the whole system moves under uniform gravity in the vertical plane through 
the rail. For the case in which the particles are of equal mass m, show that the Lagrangian is 

L = jin (2x 2 + 2ax0 + a 2 0 2 j + mga cost?, 

where x and 0 are the coordinates shown in Figure 10.9. 

Find the Hamiltonian and verify that it satisfies the equations x = dH/dp x and 0 = 
dH/dps. [Messy algebra.] 

14.6 Pendulum with a shortening string A particle is suspended from a support by a light 
inextensible string which passes through a small fixed ring vertically below the support. The 
particle moves in a vertical plane with the string taut. At the same time, the support is made to 
move vertically having an upward displacement Z(t) at time /. The effect is that the particle 
oscillates like a simple pendulum whose string length at time t is a — Z(t), where a is a positive 
constant. Show that the Lagrangian is 

L = \m — Z) 2 0 2 + Z 2 ^ + mg(a — Z)cos0, 

where 0 is the angle between the string and the downward vertical. 

Find the Hamiltonian and obtain Hamilton’s equations. Is H conserved? 

14.7 Charged particle in an electrodynamic field The Lagrangian for a particle with mass in 
and charge e moving in the general electrodynamic field {E(r, t ), B{r , 0} is given in Cartesian 
coordinates by 


L (r, r, t) = jin r • r — e<p(r,t) + e r • A(r, t), 


where r = (*, y, z ) and {</>, A} are the elcctrodynamic potentials of field { E , B). Show that 
the corresponding Hamiltonian is given by 


H (r, p, t ) 


(p ~ eA) • (p ~ eA) 

hit 


+ e<P, 
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Hamilton’s equations), their density function should remain the same, that is, p = p{q, p). 
This places a restriction on possible choices for p(q, p). Let 7 ?.q be any region of the phase 
space and suppose that, after time t , the points of IZq occupy the region 1Z,. Explain why 
statistical equilibrium requires that 



p(q , p)dv= / p(q, p) dv 
Jn, 


and show that the uniform density function p(q , p) = po satisfies this condition. [It can be 
proved that the above condition is also satisfied by any density function that is constant along 
the streamlines of the phase flow.] 

14.13 Decide if the energy surfaces in phase space are bounded in the following cases: 

(i) The two-body gravitation problem with E < 0. 

(ii) The two-body gravitation problem viewed from the zero momentum frame and with 


E < 0. 


(iii) The three-body gravitation problem viewed from the zero momentum frame and with 
E < 0. Does the solar system have the recurrence property? 

Poisson brackets 

14.14 Poisson brackets Suppose that u(q, p) and v(q . p) are any two functions of position 
in the phase space (q, p) of a mechanical system S. Then the Poisson bracket [u, v ] of u and 
v is defined by 



The algebraic behaviour of the Poisson bracket of two functions resembles that of the 
cross product U x V of two vectors or the commutator U V — V U of two matrices. The 
Poisson bracket of two functions is closely related to the commutator of the corresponding 
operators in quantum mechanics.* 

Prove the following properties of Poisson brackets. 

Algebraic properties 

[w,«] = 0, [u,m] = —[W, V], [A.|W| + A-2M2, v] = A-i[M|, u] + A.2[M2, u] 


[ [ w, u|, w] -I- [[ w, u I, u] -I- I [v, w], u ] = 0. 


This last formula is called Jacobi's identity . It is quite important, but there seems to be no way of proving 
it apart from crashing it out, which is very tedious. Unless you can invent a smart method, leave this one 
alone. 


* The commutator [ U. V | of two quantum mechanical operators U, V corresponds to ih[v 1, where h is 
the modified Planck constant, and [//, v ] is the Poisson bracket of the corresponding classical variables 
m , v. 
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FIGURE 14.5 There is no such thing as a typical orbit in the three-body 
problem. The orbit shown corresponds to the initial conditions x = 0, y = 0, 
p x = 1.03, p v = 0 and is viewed from axes rotating with the primaries. 

Suppose the primaries each have mass M and move under their mutual gravitation around 
a fixed circle of radius a, being at the opposite ends of a rotating diameter. The body P moves 
under the gravitational attraction of the primaries in the same plane as their circular orbit. 
Using Cartesian coordinates, write code to set up Hamilton’s equations for this system and 
solve them with general initial conditions. [Take M as the unit of mass, a as the unit of length, 
and take the unit of time so that the speed of the primaries is unity. With this choice of units, 
the gravitational constant G = 4.J 

By experimenting with different initial conditions, some very weird orbits can be found 
for P. It is interesting to plot these relative to fixed axes and also relative to axes rotating with 
the primaries, as in Figure 14.5. Some fascinating cases are shown by Acheson || and you 
should be able to reproduce these. [Achcson used a different normalisation however and his 
initial data needs to be doubled to be used in your code.] 
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FIGURE 15.1 The circle Cs must be chosen 
small enough so that all the phase paths that 
start within it remain within the given circle 

C A . 



the system from a configuration close to the equilibrium configuration with a small kinetic 
energy. Let S& be the smallest sphere in phase space with centre (^°\0) that contains 
all the phase paths that begin within S$. 

Definition 15.1 Stable equilibrium If the radius A tends to zero as the radius 8 tends 
to zero, then the equilibrium point at (r/' 0) , 0) is said to be stable.* 

This means that, if S is given a small nudge from a configuration close to a position of 
stable equilibrium, then the subsequent motion of S (in configuration space) is restricted 
to a small neighbourhood of the equilibrium point. Thus any mechanical system can 
perform small motions near a position of stable equilibrium. These motions are generally 
called small oscillations. 

We know that the equilibrium positions of S correspond to the stationary points of 
the potential energy V ( q ), but we have yet to identify which of these points correspond to 
stable equilibrium. In fact it is quite easy to prove the following important result: 


Minimum points of V 

The minimum points of the potential energy function V ( q ) are positions of stable 
equilibrium of the system S. 


Proof. Without loss of generality, suppose that the minimum point of the function V (q) is at q = 0 and 
that V (0) = 0. Take any A > 0. Then we must show that we can find a sphere S s in phase space such that 
all the paths that begin within it remain inside the sphere Sa • This is illustrated in Figure 15.1 for the only 
case that can be drawn, namely, when the phase space is two-dimensional; in this case, the ‘spheres’ are 
circles. 

The result follows from energy conservation. Let T (q. p) be the kinetic energy of the system. The 
total energy is then 


E(q. p) = T (q. p) + V (q). 


* In the dynamical systems literature, this is known as Liapunov stability. 
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m 



FIGURE 15.2 The double pendulum. Left: The generalised coordinates 0 , 0. Right: The 
velocity diagram. 


Since q = 0 is a minimum point of V (q) and T (q. p) is positive for p ^ 0, it follows that (0, 0), the origin 
of phase space, is a minimum point of the function E(q. p); the value of E at the minimum point is zero. 
The value of £ on the sphere S& must therefore be greater than some positive number E&. On the other 
hand, by making the radius 5 small enough, it follows by continuity that the value of £ within the sphere 
S ,5 can be made as close to £ (0, 0) (= 0) as we wish; we can certainly make it less than £a . Consider now 
any phase path starting within the circle C$. Then, by energy conservation, £ is constant along this path 
and is less than E&. Such a path cannot cross the sphere S&, for, if it did, the value of £ at the crossing 
point would be greater than E&, which is not true. Hence, any phase path starting within the sphere S$ 
must remain within the sphere and this completes the proof. ■ 


Example 15.1 Stability of equilibrium 

Consider the double pendulum shown in Figure 15.2 which moves under uniform 
gravity. Show that the vertically downwards configuration is a position of stable equi¬ 
librium. 

Solution 

The system is assumed to move in a vertical plane through the suspension point O. 
In terms of the generalised coordinates 0, <p shown, the vertically downwards config¬ 
uration corresponds to the point 0 = 0 = 0 in configuration space. The gravitational 
potential energy is given by 

V = (M + m)gb( 1 — cosO) + ///gc(l — cos0) 

so that 3V/d0 — {M -\-m)gb sinO = 0 when 0 = 0 = 0. The same is true for dV/dtp 
and so the point 0 = 0 = 0 is a stationary point of the function V (0, 0). It follows 
that the downwards configuration is a position of equilibrium. 

We could determine the nature of this stationary point by looking at the second 
derivatives of F(O,0), but there is no need because it is evident that F(O,0) > 
V (0, 0) unless 0 = 0 = 0. Thus (0, 0) is a minimum point of V and so the downwards 
configuration is a position of stable equilibrium. ■ 
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where the neglected terms have power four (or higher) in small quantities. If these approx¬ 
imate forms for T and V are now substituted into Lagrange’s equations, the linearised 
equations of motion (15.6), (15.7) are obtained immediately. [Check this.] This is clearly 
superior to our original method. 

The general approximate form of V 

In the general case, suppose that the potential energy V (q) of the system <S has a minimum 
at q = 0 and that V (0) = 0. (If the minimum point of V is not at q = 0, it can always be 
made so by a simple change of coordinates.) Then, for q near 0, V (q) can be expanded as 
an (n -dimensional) Taylor series in the variables q\, q 2 , ..., q n . For the special case when 
S has two degrees of freedom, this series has the form 


(dv av N 
v (q i, qi) = v (0,0) + (—— q 1 + -— <72 


a<7i 


a<7i 


+ 


d 2 v 

dq\ 


7 d 2 V d 2 V 2 

< 7 1 + 2 -—-— < 7 i <?2 + —~y q 2 I + ' 
a<7ia<72 dq\ ' 


where all partial derivatives of V are evaluated at the expansion point q\ = q\ = 0. 
Now V has been selected so that V (0,0) = 0. Also, since (0,0) is a stationary point of 
V(q\,q 2 ), it follows that dV/dq\ = dV/dqj = 0 there. Thus the constant and linear 
terms are absent from the Taylor expansion of V. It follows that V can be approximated 
by 


V pp (<?i,<72) = uiK/f + 2i>i2<7i<72 + ”22 q 2 > 


where v\ \, U 12 , v 22 are constants given by 


d 2 V d 2 V d 2 V 

vii = ——--CO, 0) V \ 2 — — - (0,0) v 22 = —=-(0,0) 

dq 2 dqidq 2 9 q\ 


and the neglected terms have power three (or higher) in the small quantities q\, q 2 . The 
corresponding approximation to V (q) in the case when S has /j-degrees of freedom is 


n n 

V app (q) = v J k ( U ( l k 


7=1 k=l 


(15.8) 


where the {vy*} are constants given by 


Vjk = Vkj = 


d 2 V 


dqjdq k 


q =o 
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and the neglected terms have power three (or higher) in the small quantities q\, q 2 , 
q n . This is the general form of the approximate potential energy V app (q). It is a homo¬ 
geneous quadratic form in the variables < 71 , < 72 ,, q n - 

In the theory that follows, we will always assume that q = 0 is also a minimum point 
of the approximate potential energy V app (q).* This condition is equivalent to requiring 
that the quadratic fonn (15.8) should be positive definite. This simply means that it takes 
positive values except when q = 0. 

The general approximate form of T 

For any standard mechanical system with generalised coordinates q, the kinetic energy T 
has the form 


n n 

T(q,q) = ^^tj k {q)qjq k , 
j =1 k=\ 

a quadratic form in the variables < 71 , < 72 * ..., Qn with coefficients that depend on q. If 
we expand each of these coefficients as a Taylor series about q = 0 , the constant term is 
simply tj k (0) and 


n n 

t = E£o.(°)^ + •••. 

j=l k= 1 

It follows that T can be approximated by 


^ app = EE^w 


j= 1*=1 


(15-9) 


where the constants [tj k ] are what we previously called {tj k ( 0)}, and the neglected tenns 
have power three (or higher) in the small quantities q\, < 72 , ..., q„, q\, < 72 * • • •, Qn- This 
is the general form of the approximate kinetic energy V /app (^). It is a homogeneous 
quadratic form in the variables < 71 , < 72 , • • •, qn- Since T(q,q) > 0 except when q = 0, it 
follows that the quadratic fonn (15.9) must also be positive definite. 

Useful tip: Since T app {q) = T (0, q), it follows that T app can be be found directly by 
calculating T when the system is passing through the equilibrium position ; the general 
formula for T need never be found! 


It might appear that this follows the fact that q = 0 is known to be a minimum point of the exact V , but 
this is not necessarily so. For example, \iV(q\, qi) = q\ + q%, then V app = q which does not have 
a strict minimum at q\ = q 2 = 0 . The general theory of small oscillations does not apply to such cases, 
and we exclude them. 
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The l/-matrix and the T-matrix 

In order to express the general theory concisely, we introduce the n x n matrices V and T 
as follows: 

Definition 15.2 The V-matrix and the T-matrix The symmetric n x n matrix V whose 
elements are the coefficients [Vjk) that appear in the formula (15.8) is called the V-matrix. 
The symmetric n x n matrix T whose elements are the coefficients {tjk} that appear in the 
formula (15.9) is called the T -matrix. 

In terms of V and T, the approximate potential and kinetic energies of S can be written 
in compact matrix notation:* 


Quadratic forms for V app and T app 


n n 



(15.10) 


n n 



where q is the column vector with elements {cjj }, and q is the column vector with elements 


{*/)■ 


Example 15.2 Finding V and T for the double pendulum 


Find the matrices V and T for the double pendulum. 


Solution 

For the double pendulum, V /app is given by 


V app = j(M + m)gb0 2 + j mgeep 2 , 



and F app is given by 


r app = I(M + m)b 2 6 2 + mbc0j> + |///c 2 0 2 



* The notation x' means the transpose of the column vector X. The alternative notation X 1 would cause 
confusion here. 
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Hence. V and T are the 2x2 matrices 

w _ / j{M + m)gb 0 \ T _ (\{M + m)b 2 \mbc\ 

V " V " 0 \mgc)' ' " V 5 mbc {me 2 ) * 

15.3 THE GENERAL THEORY OF NORMAL MODES 

In this section, wc develop the general theory of normal modes for any oscillating 
system. This extends the method described in Chapter 5, which was restricted to two 
degrees of freedom. 

The small oscillation equations 

The first step is to obtain the general fonn of the small oscillation equations. This is 
done by substituting the approximate potential and kinetic energies \/ app and T app into 
Lagrange’s equations. Now 

dT app A ar app ay app A 

so that Lagrange’s equations become 


Small oscillation equations 

Expanded form: 

n 

(o* <7* + v jk (Ik) = 0 

k=\ 

(1 < j < n) 

Matrix form: 

Tq+Vq = 0 


in the expanded and matrix forms respectively. These are the linearised equations for the 
small oscillations of S about the point q = 0. They are a set of n coupled second order 
linear ODEs satisfied by the unknown functions q\{t), </2(0> •••><'/«(0- 

Normal modes 

The next step is to find a special class of solutions of the small oscillation equations known 
as normal modes. Wc will show later that the general solution of the small oscillation 
equations can be expressed as a sum of normal modes. 

Definition 15.3 Normal mode A solution of the small oscillation equations that has 
the special form 
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Expanded form: 

qj — a j cos (cot — y) 


(1 < ; < n) 

Matrix form: 

q = a cos(a»/ — y) 


(15.12) 


where the {a j }, co and y are constants, is called a normal mode of the system S. 

Notes. In a normal mode, the coordinates < 71 , qj ,..., q n all vary harmonically in time 
with the same frequency co and the same phase y; however, they generally have different 
amplitudes a \, 02 , . •., a„. The n -dimensional quantity a = (a\ , 02 * • • •» a n) is called the 
amplitude vector of the mode and, when considered to be a column vector, will be written 
a. Without losing generality, the angular frequency co can be assumed to be positive. 

On substituting the normal mode form (15.12) into the small oscillation equations 
(15.11), we obtain, on cancelling by the common factor cos (cot — y). 


Equations for the amplitude vector 

Expanded form: 

n 

J2 ( v jk - o) 2 t jk ) a k = 0 

A = l 

(1 < j < n) 

Matrix form: 

(V -or T)-a = 0 


This is an n x n system simultaneous linear algebraic equations for the coordinate 
amplitudes {a k }. A normal mode will exist if we can find constants {a*}, co so that the 
equations (15.13) are satisfied. Since the equations are homogeneous, they always have 
the trivial solution a\ = 02 = • • • = a„ = 0 , whatever the value of co. However, the 
trivial solution corresponds to the equilibrium solution q\ = qi = ••• = </„ = 0 of the 
governing equations (15.11), which is not a motion at all. We therefore need the equations 
(15.13) to have a non-trivial solution for the {a/, }. There is a simple condition that this 
should be so, namely that the determinant of the system of equations should be zero, that 
is, 


Determinantal equation for co 

det (V - a ; 2 T) = 0 


(15.14) 


This is the equation satisfied by the angular frequency co in any normal mode of the system 
S. When expanded, this is a polynomial equation of degree n in the variable co 2 . If this 
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equation has any real positive roots co 2 , co..., then, for each of these values of co, the 
linear equations (15.13) will have a non-trivial solution for the amplitudes {a* } and a 
normal mode will exist. 

Definition 15.4 Normal frequencies The angular frequencies co\, 002 , ... of the nor¬ 
mal modes are called the normal frequencies of the system S. 

The normal frequencies are a very important characteristic of an oscillating system. 
They are found by solving the determinantal equation (15.14) for co. In the example that 
follows, we find the normal frequencies of the double pendulum, and three further worked 
examples are given in section 15.5. 


Example 15.3 Normal frequencies of the double pendulum 

Find the nonnal frequencies of the double pendulum for the case in which M = 3m 
and c = b. 


Solution 

With these special values, the matrices V and T become 


V = 


jingb 





The determinantal equation for co is therefore 


del 


\mgb 


4 0 
0 1 


— ^ mb 2 co 2 


(::)] 


= 0 , 


which can be simplified into the form 


4 n 2 — 4 co 2 


—co 


-co 

2 2 
n — co 


= 0 , 


where n 2 = g/b. On expanding this detenninant, we obtain 


3 co 4 — Sn 2 co 2 + 4 n 4 = 0, 


which is a quadratic equation in the variable co 2 . This is the equation satisfied by the 
nonnal frequencies. This quadratic factorises and has two real positive roots co 2 , co 2 
for co 2 , where 


co 2 = —, co 2 = 2 n 2 , 

where n 2 = g/b. The double pendulum therefore has the two normal frequencies 

(2^/3/^) 1/2 and (2 g/b)^ 2 . ■ 
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FIGURE 15.3 Normal modes of the double pendulum. Left: The slow mode. Right: The fast 
mode. (The angle e, which should be small, is made large for clarity.) 


Question Form of the normal modes 

What do the normal mode motions of the double pendulum look like? 


Answer 

To answer this we need to find the coordinate amplitudes in each of the nonnal modes. 
If the amplitudes of 0 and cp arc a\ and ai respectively, then these amplitudes satisfy 
the linear equations 

/ 4 n 2 - 4co 2 -co 2 
V -o> 2 n 2 -co 2 ){a 2 )-°- 

Slow mode: When co 2 = co 2 = 2ii 2 /3, the equations for the amplitudes a\, 
become, after simplification. 



Each of these equations is equivalent to the single equation la \ =02 so that we have 
the family of non-trivial solutions a\ = e, ai = 2e, where € can take any (non-zero) 
value. There is therefore just one slow normal mode. It has the form 

e = «cos(V273«f - y ), mshi 

cp = 2c cos(s/2/3nt — y). 


where the amplitude factor c and phase factor y can take any values.* This mode is 
shown in Figure 15.3 (left). 

Fast mode: In the fast mode, we have a, 2 = In 2 and, by following the same proce¬ 
dure, we find that there is also one fast normal mode. It has the form 


6 = e cos(y/lnt — y), 
cp = —2c cos(V / 2 nt — y), 


(15.16) 


where the amplitude factor e and phase factor y can take any values. This mode is 
shown in Figure 15.3 (right). ■ 


* However, the linearised theory is a good approximation only when e is small. 
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15.4 EXISTENCE THEORY FOR NORMAL MODES 

So far we have not said anything general about the number of normal mode 
motions that a system posesses. This is related to the number of veal positive roots of 
the equation 


dct (V - XT) = 0. (15.17) 

When expanded, this is a polynomial equation of degree n in the variable X, where n is 
the number of degrees of freedom of the system S. Such an equation always has n roots 
in the complex plane, but there seems to be no reason why any of them should be real, 
let alone positive. In fact, all of the roots of equation (15.17) are real and positive. This 
follows from generalised eigenvalue theory, which we will now develop. 

Definition 15.5 Generalised eigenvalues and eigenvectors Let K and L be real n x 
n matrices. If there exists a number X and a (non-zero) column vector X such that 

K-x = XL-x, (15.18) 

then X is said to be a generalised eigenvalue of the matrix K (with respect to the matrix 
L) and x is a corresponding generalised eigenvector * 

The defining equation (15.18) can also be written 

(K — XL) • x = 0, (15.19) 

which has a non-zero solution for X only when 

det(K — X L) = 0. (15.20) 

This is the equation satisfied by the eigenvalues. Provided that L is a non-singular matrix, 
the eigenvalue equation is a polynomial equation of degree n in X, which has n roots in 
the complex plane. The more one knows about the matrices K and L, the more one can 
say about their eigenvalues and eigenvectors. 

Theorem 15.1 Eigenvalues of symmetric, positive definite matrices If K and L 

are real symmetric matrices and L is positive definite f then all the eigenvalues are real. 
If the matrix K is also positive definite , then all the eigenvalues are positive. 

Proof. Let x be any complex column vector and consider the scalar quantity x! • K • x, where x is the 
complex conjugate of x. Then, since K is real and symmetric, 

x • K • x = x' • K • x = x' • K • x = (x • K • x)' = x • K' • x 
= x K • x. 


* Ordinary eigenvalues and eigenvectors correspond to the special case when L = 1, the identity matrix. 

T A matrix A is called positive definite if its associated quadratic form x' • A • X is positive definite. Since 
this condition is known to hold for V and T. both these matrices must be positive definite. 


Copyrighted material 




15.4 Existence theory for normal modes 


435 


Suppose for example that the oscillating system has six degrees of freedom and that 
the determinantal equation (15.14) is 

(SI 2 - co 2 ) 2 (4Q 2 - co 2 ) 3 (25S2 2 - co 2 ) = 0, 

after factorisation, where S2 is a positive constant. The normal frequencies are then co\ = 
SI (double root), coj = 2SI (triple root) and cot, = 5S2 (simple root). There arc therefore two 
normal modes associated with the normal frequency co\, three normal modes associated 
with the normal frequency cot, and one normal mode associated with the normal frequency 
cot,. The total number of normal modes is six, which is equal to the number of degrees of 
the system. 

Definition 15.6 Degenerate frequencies If a normal frequency has more than one 
normal mode associated with it, then that frequency is said to he degenerate. 

In the example above, the normal frequencies co\ and C 02 are degenerate, but cot, is not. 
The notion of degeneracy is important in quantum mechanics, where normal frequencies 
correspond to the energies of stationary states. An unperturbed atom may have an energy 
level E that is (say) five-fold degenerate. When the atom is perturbed (by a magnetic field, 
for example) the energies of five states may be changed by differing amounts so that the 
energy level is ‘split’ into five nearly equal levels. This is an important effect in the theory 
of atomic spectra. 


Existence of normal modes 

• For any oscillating system, the roots of the eigenvalue equation 

det (V — k T) = 0 

are all real and positive and their values are the squares of the normal frequencies 
{coj } of the system. 

• If co 2 is a simple root of the eigenvalue equation, then the equations 

(V - co 2 T) . a = 0 

for the amplitude vector a have a non-trivial solution that is unique to within a 
multiplied constant. There is therefore only one normal mode associated with the 
normal frequency co\. 

• More generally, if the root is repeated k times, then the equations for the 
amplitude vector a have k linearly independent solutions. The normal frequency 
co\ is then degenerate with k normal modes associated with it. 

• In all cases, an oscillating system with n degrees of freedom has a total of n 
normal modes. 
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is modified to 


V\ — 7o A i + — To | ^ + ''' J + j<x | ~ + 


Toy] 

2a 


+ 


where the neglected terms have power four (or higher) in the small quantity yq. Note 
that, in the quadratic approximation, the spring constant a docs not appear so that the 
increase in tension of the string is negligible. 

In the same way, the potential energies of the other three sections of string are 
given by 

„ To ()'2 — >’i) 2 , l7 7o(>’3 - yi) 2 , t/ T 0 yj 

v 2 = ---+ • • •, v 3 = ---+ • • •, v 4 = + • • • , 

2 a 2 a 2a 

and the total approximate potential energy is given by 

Toy] , 7o(.y 2 -y,) 2 , T 0 (y 3 - y 2 ) 2 T 0 y \ 


^/app _ 1 , 

2 a 
To 


2a 


+ 


2a 


+ 


2 a 


— ^ (2y? + 2y| + 2y] — 2yi y 2 — 2y 2 y 3 ^ • 


The V -matrix is therefore 


To 


2 -1 O' 


V = I — 1 2-1 
2a ' 0—1 2 


In this problem, the exact and approximate kinetic energies are the same, namely 
T = T app = i(3 m)y] + 1(4 m)yj + 1(3/«)>’!, 


so that the T -matrix is 


3 0 O' 
T = 1 m | 0 4 0 
0 0 3 


The eigenvalue equation dct(V — XT) = 0 can therefore be written 


2-3^ -1 0 

-1 2 - 4 m -1 

0 -1 2 - 3m 


= 0, 


where m = maco 2 /To. When expanded, this is the cubic equation 


18m - 33m + 17m - 2 = 0 
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FIGURE 15.5 A simple classical model of a linear 
symmetric molecule. 


Example 15.5 The linear triatomic molecule 

A symmetric linear triatomic molecule is modelled by three particles connected by 
two springs, arranged as shown in Figure 15.5. Find the frequencies of rectilinear 
vibration of the molecule and the forms of the normal modes. 


Solution 


Let the centre atom have mass in, the outer atoms have mass M, and the springs have 
constant a. In this context, the spring constant is a measure of the ‘strength’ of the 
chemical bond between the two atoms. (We will suppose that the interaction between 
the two outer atoms is negligible.) Let the displacements of the three atoms from 
their equilibrium positions be x\, xi, x-$ as shown in Figure 15.5. Then the kinetic 
and potential energies of the molecule arc given by 


T = T app = \Mxf + \mx\ + \Mx\, 

V = V app = ja(x2 — JT|) 2 + ja(x3 — Xj) 2 



The T - and V - matrices are therefore 



Although everything looks normal, this problem has a non-standard feature in that the 
potential energy of the molecule does not have a true minimum* at jcj = jo = *3 = 0 . 
This is actually clear from the start since the potential energy is unchanged if the 
whole molecule is translated to the right or left. Strictly speaking then, our theory 
does not apply to this problem, but, fortunately, only minor modifications are needed. 
In cases like this, it turns out that one or more of the normal frequencies is zero. 
These zero frequencies do not correspond to true normal modes. They correspond 
to uniform translational (or rotational) motions of the molecule as a whole. In the 
present case, the only uniform motion allowed is translational motion along the line 
of the molecule, so that we expect just one of the normal frequencies to be zero. 


* As a result, the V -matrix is not positive definite. 
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Molecule 1 (cm ') X 2 1 (cm *) A. 2 'A) 1 (1+2y) 1 / 2 

O-C-O 1337 2349 1.76 1.91 

S-C-S 657 1532 2.33 2.52 

Table 2 Vibrational frequencies of linear triatomic molecules; 
comparison of theory and experiment. 





FIGURE 15.6 A flat plate of general shape is suspended by three equal strings. 

potential energy. It is all the more surprising then that the following problem has a simple 
exact solution. 

Example 15.6 Plate supported by three strings 

A Hat plate of general shape and general mass distribution is suspended in a horizontal 
position by three vertical strings of equal length. Find the normal frequencies of small 
oscillation. 

Solution 

The system is shown in its equilibrium position in Figure 15.6, top left. The three 
strings arc of length l and arc attached to the points A, B,C of the plate. The point 
K (see Figure 15.6, top right) is the centre of the circle that passes through A, B , C. 
Our initial choice of generalised coordinates is shown in Figure 15.6, bottom left. 
Xk, Yk arc the horizontal Cartesian* displacement components of the point K from 


* These can be any Cartesian axes KqX}z with Kqz pointing vertically upwards; {/, j. k} are the corre¬ 
sponding unit vectors. 
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its equilibrium position K o, and 0 is the rotation angle of the plate about the vertical 
axis through K o.* Three coordinates are sufficient since the three string constraints 
reduce the number of degrees of freedom of the plate from six to three. 

We will now calculate the potential energy of the plate in terms of the coordi¬ 
nates X k , Yk and 6. This is the tricky step since the plate does not remain horizontal 
and this complicates the geometry. However, the vertical displacement of any point of 
the plate is quadratic in the small quantities Xk, Yk,6 and, providing care is taken, 
this enables us to use approximations. Consider first A a, the displacement of the 
point A. This is given approximately by 

An = Xu i + Yk j + (Ok) x a + • • • , 


correct to the first order in small quantities, where a is the position vector of A relative 
to K{) in the equilibrium position. As expected, this displacement is horizontal, correct 
to the first order in small quantities. The square of the magnitude of this horizontal 
displacement is therefore 

(X K i + Y K j + Okxa ) 2 


= X~j( -T 2X k 0 (i • (Arxfl)) -I- 2 Y k 0 ( j • (Arxfl)) 

= x\ + Y\ + R 2 e 2 + 2 Y K 6(a • i) - 2X K 6{a • j), 

correct to the second order in small quantities, where R (= |a|) is the radius the circle 
passing through A, B andC. Since A is one of the points that is suspended by a string 
of length l, an application of Pythagoras shows that the vertical displacement of the 
point A is given by 


ZA = 



(a • j). 


correct to the second order in small quantities. Similar expressions exist for z# and 
zc with a replaced by b and c respectively. Now for the clever bit. Since the plate is 
flat, it follows that a general point of the plate with position vector r relative to Ko in 
the equilibrium position must have vertical displacement 1 


X\ + Vi±RW\ (2 Yk9_\ (2XkI 


21 


V 2 i ) 


2t 


(r • j) 


(15.21) 


in the displaced position, correct to the second order in small quantities. This expres¬ 
sion confirms that the plate is not generally horizontal in the displaced position. 


* As we will see, the plate does not remain horizontal and so the angle 6 ought to be defined more carefully. 
Let K P be any line fixed in the plate and let the projection of this line on to the equilibrium plane of the 
plate be K'P'. The angle G can be properly defined as the angle turned through by the line K'P '. This is 
not quite the same as the angle turned through by the projection of some other line lying in the plate, but 
the differences are quadratic in the small quantities XY^ and 6 and will turn out to be immaterial. 

' This is because the expression (15.21) is linear in r and gives the correct z-values at A, B, C. 
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The purpose of all this is to calculate the potential energy V = M gZQ, where M 
is the mass of the plate and Zq is the vertical displacement of the centre of mass G. 

With no loss of generality we may take the axis K q,v to point in the direction KqGq so 
that rQ = Di, where D is the distance KG. The general formula (15.21) then shows 
that 


zc 


X\ + Y K + R2e2 
21 


+ 


m 


X 2 + (Y K + D 0) 2 + (R 2 - D 2 )0 

2 i 


D 

2 


correct to the second order in small quantities. Hence the approximate potential 
energy is given by 


v app = [ x\ + (Y K + D of + ( r 2 - d 2 )o 2 j. 

This formula can be simplified further by a change of generalised coordinates. 
Let Xq, Yq be the horizontal Cartesian displacement components of the centre of 
mass G from its equilibrium position Go, and Oq be the rotation angle of the plate 
about the vertical axis through Gq. Then, correct to the first order in small quantities, 

X C =X K , Y c = Y k + D0, 0 g = 0. 

In terms of the generalised coordinates Xq, Yq, 0 (we will drop the subscript 
from Oq from now on) the expression for V app is 

V“PP = £ (x 2 g + Yl + (R 2 - D 2 ) 0 2 ) . 

and the V -matrix is 


V = 


Mg_ 

2 1 


10 0 \ 
0 10 
0 0 R 2 - D 2 ) 


a remarkably simple result in the end. 

The approximate kinetic energy is calculated when the plate is passing through 
its equilibrium position. This is simply 

T app = \mx 2 g + \myI + \l 0 2 , 


where / is the moment of inertia of the plate about the axis through G perpendicular 
to its plane. If we write / = Mk 2 , then the T -matrix becomes 


/I 0 0\ 
T=}*f 0 1 0 

\0 0 k 2 ) 
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In normal mode theory, this result is of little consequence since the normal frequencies 
are simply the roots of a polynomial equation which can always be solved numerically. 
However, Rayleigh’s principle has extensions to many areas of applied mathematics and 
physics such as continuum mechanics and quantum mechanics. In these subjects, the 
oscillation problems often cannot be solved, even numerically, and Rayleigh’s principle is 
one of the few ways in which information can be gained about the fundamental mode. For 
example, in quantum mechanics Rayleigh’s minimum principle takes the form: 

Suppose S is a quantum mechanical system with Hamiltonian H and ground state 
energy E \. Then 


< xIHIx) 

- > E |, 

<X|X> 

for any choice of the quantum state x. 

15.7 GENERAL SMALL OSCILLATIONS 

Normal modes are special small motions but, from them, we can generate the 
general solution of the small oscillation equations. The result is as follows: 


General solution of small oscillation equations 

The general solution of the small oscillation equations can be expressed as a linear 
combination of nonnal modes in the form 

q(t) = C\a\ cos(co\t — y\) + Cjdi cos(a>2f — Yi) + -b C n a„ cos (co„t — y „) 

where the amplitude factors {C 7 } and phase factors [y t ) are arbitrary constants. 


Proof. Suppose that q(t) is any solution of the small oscillation equations (15.11). We will now show 
that we can construct a linear combination of normal modes that satisfies the small oscillation equations 
and also satisfies the same initial conditions as q(t). To do this, take a general linear combination of normal 
modes in the form 

q*(t) = C|fl| cos(a>p - y\) + C 2 a 2 COs(a> 2 t — yf) ^ -h C„a n cos (io„t - y„) 

= a | Ml cos cop + By sina>20 + ^2 M 2 cosa, 2 1 + #2sina>20 + 

• •■ + a n ( A„ cos (o„t + B„ sinco„f), 

on writing Cj cos (cojt — yj) = Aj cos cojt + Bj sin cojt. Since the small oscillation equations are linear 
and homogeneous, q*(t) is a solution for all choices of the coefficients (Ay), {Bj}. We now need to choose 
the coefficients My}, {Bj} so that ^*(0) = q( 0) and < 7 *( 0 ) = ^(0). This requires that the coefficients My} 
be chosen so that 


Ayay + A 2 a 2 H-1- A„a n = q( 0), 

and that the coefficients {Bj } be chosen so that 

(Bycay)ay + (B 2 a>2)«2 H-1- (B n co„)a n = ^(0). 
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Now consider a change of coordinates from q to q defined by the linear transformation* 

q = P • n r| = P -l -q (15.28) 

where P can be any non-singular matrix. On substituting the transformation (15.28) into 
the expressions (15.27), we obtain 

r app = (P . n)' • t . (P. n) = ij' • ( p ' • t • P) • n, 
v app = (P. n)' • v . (P . n) = n' • ( p ' • v • P) • n , 

from which we see that this transformation of coordinates causes V and T to be trans¬ 
formed as 


T-»P'.T.P. V^P'-V-P. 


(15.29) 


Can we now choose the transformation matrix P so that the new T - and V -matrices are 
diagonal? 

Let a |, d 2 , ..., be the amplitude vectors of the normal modes when they are 
expressed in terms of the coordinates q and let co\, a> 2 , ..., oo n be the corresponding 
normal frequencies. We will suppose that these amplitude vectors have been chosen so 
that they satisfy the orthonormality relations (15.24), that is 


a 'j • T • a k = 


0 (;#*). 
1 U = k). 


(15.30) 


Now consider the matrix P whose columns are the amplitude vectors {a ,}, that is, 


P = (a 1 1 a 2 1***1 a„). 


(15.31) 


Since the amplitude vectors are known to be linearly independent, P has linearly inde¬ 
pendent columns and is therefore a non-singular matrix. Let us now try this P as the 
transformation matrix. Then 


P' T. P = 



(a, | a 2 |• • • | a„). 


* For example, in the case of two degrees of freedom, this transformation has the form 

<71 = P 11 VI + Pl2 V2> 

<72 = Pl\ VI + P22 V2- 
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The jk -th element of this matrix is given by 

o 0’ # k) 

1 O' = k) 

by the orthonormality relations. Hence, with this choice of P, 

P' • T • P = 1, 


a, T-a* = 


where 1 is the identity matrix. In the same way. 


P' • V • P = 


a', \ 


a. 


Va 'J 


• V 


Oil a 2 | • • -1a„). 


The jk- th element of this matrix is given by 

a ' j • V • a* = a'j • (V • a*) = a! } • T • a*^ = co 2 (a'j • T • 

0 (j # k), 

0) 2 j 0 = *)• 

Hence, with this choice of P, 

p' • v • p = n 2 . 


where O is the diagonal matrix whose diagonal elements are the normal frequencies, that 

is, 


n = 


/ co\ 0 • • • 0 \ 
0 (02 ■ ■ • 0 


\ 0 0 • • • (V n / 


We have thus succeeded in reducing both V and T to diagonal form. Hence the coordinates 
{rij} defined by (15.28) with P = (a 1 1 a 2 1 • • • | a „) are a set of normal coordinates. They 
are given explicitly by 


il = P * q = (P * T) • q, 

on using the formula P' • T • P = 1 . This can also be written in the semi-expanded form 

r]j = (a^ • t) • q (1 < j < n). (15.32) 
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From this last formula, we can see that, if the amplitude vectors {a,} are not normalised, 
then the coordinates {r/j} are simply multiplied by constants. They are therefore still 
normal coordinates. The corresponding V- and T-matrices are still diagonal, but T is no 
longer reduced to the identity. 

Our results are summarised as follows: 


Finding normal coordinates 

Let a i, a2, ..., a„ be the amplitude vectors of the normal modes when expressed in 
terms of the coordinates {</;}. Then the coordinates {t]j} defined by 

*lj = ( a y * T ) • q (1 <j<n) 

are a set of normal coordinates, as are any constant multiples of them. (The ampli¬ 
tude vectors only need to be normalised if it is required to reduce the matrix T to the 
identity.) 


When expressed in terms of nonnal coordinates, the small oscillation equations 
become 


ir\ + O 2 • q = 0. 

In expanded form, this is 

Vj + tfvj = 0 (1 < j < n ), 

a system of n uncoupled SHM equations. The solution 771 = C\ cos(a>U — yi), 772 = 
773 = • • • = t] n = 0 is the first normal mode, the solution 772 = C 2 Cos(co 2 t — 72 ), 
771 = 773 = • • • = rj„ = 0 is the second normal mode, and so on. 

Note. Using nonnal coordinates is not a practical way of solving nonnal mode problems. 
Indeed the problem has to be solved before the nonnal coordinates can be found! Nonnal 
coordinates are important because they simplify further developments of the general theory. 

Example 15.8 Finding normal coordinates 

Find a set of normal coordinates for the double pendulum problem. 

Solution 

For the double pendulum problem, we have already found that 



Copyrighted material 






452 


Chapter 15 The general theory of small oscillations 


Hence, on dropping the inessential constant factor \mb 2 , a set of normal coordinates 
is given by 


and 


” l = (l 2) '(t i)'(^) =a ' + ^ 

«= (| - 2 >-(? !)-C )= 29 -*■ 


Since normal coordinates may always be scaled, we can equally well take 


rj\ — 26 + 0 , 

112 = 20 - 0 , 


as our normal coordinates. ■ 


Problems on Chapter 15 

Answers and comments are at the end of the book. 

Harder problems carry a star (*). 

Two degrees of freedom 

15.1 A particle P of mass 3/// is connected to a particle Q of mass 8 in by a light elastic spring 
of natural length a and strength a. Two similar springs arc used to connect P and Q to the 
fixed points A and B respectively, which are a distance 3 a apart on a smooth horizontal table. 
The particles can perfonn longitudinal oscillations along the straight line AB. Find the normal 
frequencies and the forms of the normal modes. 

The system is in equilibrium when the particle P receives a blow that gives it a speed u 

in the direction AB. Find the displacement of each particle at time t in the subsequent motion. 

15.2 A particle A of mass 3 in is suspended from a fixed point O by a spring of strength a 
and a second particle B of mass 2 m is suspended from A by a second identical spring. The 
system performs small oscillations in the vertical straight line through O. Find the normal 
frequencies, the forms of the normal modes, and a set of normal coordinates. 

15.3 Rod pendulum A uniform rod of length 2 a is suspended from a fixed point O by a light 
inextensible string of length b attached to one of its ends. The system moves in a vertical plane 
through O . Take as coordinates the angles 0, <p between the string and the rod respectively 
and the downward vertical. Show that the equations governing small oscillations of the system 
about 0 = 0 = 0 are 

bO + acp = — gO, 


bO + | acp = -£0. 


Copyrighted material 



15.8 Problems 


453 


For the special case in which b = 4«/5, find the normal frequencies and the forms of the 
normal modes. Is the general motion periodic? 

Three or more degrees of freedom 

15.4 Triple pendulum A triple pendulum has three strings of equal length a and the three 
particles (starting from the top) have masses 6m, 2m, m respectively. The pendulum performs 
small oscillations in a vertical plane. Show that the normal frequencies satisfy the equation 

12/i 3 -60p 2 + S\p -27 = 0, 

where p = aco 2 /g. Find the normal frequencies, the forms of the normal modes, and a set of 
normal coordinates, [p = 3 is a root of the equation.] 

15.5 A light elastic string is stretched to tension To between two fixed points A and D a 
distance 3 a apart, and two particles of mass m arc attached to the string at equally spaced 
intervals. The strength of each of the three sections of the string is a. The system performs 
small oscillations in a plane through AB. Without making any prior assumptions, prove that 
the particles oscillate longitudinally in two of the normal modes and transversely in the other 
two. Find the four normal frequencies. 

15 .6 A rod of mass M and length L is suspended from two fixed points at the same horizontal 
level and a distance L apart by two equal strings of length b attached to its ends. From each 
end of the rod a particle of mass m is suspended by a string of length a. The system of the rod 
and two particles performs small oscillations in a vertical plane. Find V and T for this system. 
For the special case in which b = ?>a/2 and M = 6m/5, find the normal frequencies. Show 
that the general small motion is periodic and find the period. 

15.7 A uniform rod is suspended in a horizontal position by unequal vertical strings of lengths 
b, c attached to its ends. Show that the frequency of the in-plane swinging mode is ((b + 
c)g/2bc) {/2 , and that the frequencies of the other modes satisfy the equation 

be p 2 — 2a (b + c)p + 3 a 2 = 0, 

where p = aco 2 /g. Find the normal frequencies for the particular case in which b = 3 a and 
c = 8 a. 

15.8* A uniform rod BC has mass M and length 2a. The end B of the rod is connected to a 
fixed point A on a smooth horizontal table by an elastic string of strength oc\ , and the end C is 
connected to a second fixed point D on the table by a second elastic string of strength aj. In 
equilibrium, the rod lies along the line AD with the strings having tension To and lengths b, c 
respectively. Show that the frequency of the longitudinal mode is ((«i + «2 )/M)^ 2 and that 
the frequencies of the transverse modes satisfy the equation 

b 2 c 2 p 2 — 2bc(2ab + 3 be + 2 ac)p + 6abc{2a + b + c) = 0, 

where p = Maco 2 /To. [The calculation of F app is very tricky.] 
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Find the frequencies of the transverse modes for the particular case in which a = 3c and 
b = 5c. 

15.9* A light elastic string is stretched between two fixed points A and B a distance (n + 1 )a 
apart, and n particles of mass m are attached to the string at equally spaced intervals. The 
strength of each of the n + I sections of the string is a. The system performs small longitudinal 
oscillations along the line AB. Show that the normal frequencies satisfy the determinantal 
equation 


A 


n 


2cos6> -1 0 ••• 0 0 

-I 2cos0 -1 ... 0 0 


0 

0 


0 

0 


= 0 , 


0 ••• 2cos6> -1 
0 ••. — 1 2 cos 6 


where cost? = 1 — (inco 2 /2a). 

By expanding the determinant by the top row, show that A„ satisfies the recurrence rela¬ 
tion 


A„ = 2cos0A„_i — A„- 2 , 
for it > 3. Hence, show by induction that 

A„ = sin(/? + 1)0/ sin 6. 

Deduce the normal frequencies of the system. 

15.10 A light string is stretched to a tension 7o between two fixed points A and B a distance 
(n + 1 )a apart, and n particles of mass in are attached to the string at equally spaced intervals. 
The system performs small plane transverse oscillations. Show that the normal frequencies 
satisfy the same determinantal equation as in the previous question, except that now cos# = 
1 — (maco 2 /2To). Find the normal frequencies of the system. 

Vibrating molecules 

15.11 Unsymmetrical linear molecule A general linear triatomic molecule has atoms A \ , 
A 2 , At, with masses in \ , in 2 , hit,- The chemical bond between A\ and A 2 is represented by a 
spring of strength 012 and the bond between A 2 and /I 3 is represented by a spring of strength 
»23- Show that the vibrational frequences of the molecule satisfy the equation 

in 1 /// 2 >n 3 co 4 — |ai 2 /» 3 (/» 1 + in 2 ) + «23/» 1 (in 2 + //13)] co 2 


+ 0’i2Q'23(w 1 + "12 + HI 3 ) = 0 . 

Find the vibrational frequencies for the special case in which in \ = 3///, 1112 = in, 1112 = 2m 
and <*i 2 = 3a, 023 = 2a. 
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B atom at the centroid. Show that the molecule has six vibrational modes of which five are 
in the plane of the molecule; show also that the out-of-plane mode and one of the in-plane 
modes have axial symmetry; and show finally that the remaining four in-planc modes arc in 
doubly degenerate pairs. Deduce that the BCI 3 molecule has a total of four distinct vibrational 
frequencies. 

Computer assisted problem 

15.14 Sulphur dioxide molecule Use computer assistance to obtain an equation satisfied by 
the squares of the frequencies of the symmetric modes of a V-shaped molecule. For the special 
case in which M = 2m and a = 60°, show that the frequencies of the symmetrical modes 
satisfy the equation 


4p~ — (5 + &c)p 4 c — 0, 


where p = mco 2 /k. 

The sulphur dioxide molecule O-S-O has mass ratio M/m = 2 and an apex angle 
very close to 120°. Its infrared absorbtion wave numbers are found to be A .]" 1 = 1151 cm -1 , 
A .^" 1 = 525 cm -1 , A .^ 1 = 1336 cm -1 . Show that there is no value of e that fits this data with 
reasonable accuracy. This is a deficiency of our simple (central force) model of interatomic 
forces, which gives poor results for V-shaped molecules (see Hcrzberg [13]). 


Copyrighted material 


Chapter Sixteen 


I 


Vector angular velocity 
and rigid body kinematics 


KEY FEATURES 

The key features in this chapter are vector angular velocity and the kinematics of rigid bodies 
in general motion. 


This chapter is concerned with the kinematics of rigid bodies in general motion. 
In Chapter 2 we considered only those rigid body motions that were essentially two- 
dimensional, and angular velocity appeared there as a scalar quantity. In general three- 
dimensional rigid body motion, this approach is no longer adequate and angular velocity 
must be introduced in its proper role as a vector quantity. The principal result of the 
Chapter is that any motion of a rigid body can be represented as a sum of translational 
and rotational contributions. 

16.1 ROTATION ABOUT A FIXED AXIS 

In this chapter we adopt a more rigorous approach to rigid body rotation than we 
did in Chapter 2. We begin with a proper definition of rigidity. 

Definition 16.1 Rigidity A body B is said to be a rigid body if the distance between 
any pair of its particles remains constant. That is, if Pi and Pj are typical particles ofB 
with position vectors ri(t) and rj(t ) at time t, then 


\fi(t) ~ rj(t )| = Cjj, 


( 16 . 1 ) 


where the c,j are constants. 

Suppose a rigid body B is rotating about a fixed axis with angular speed co. This 
motion certainly satisfies the rigidity conditions (16.1). Let 7t be a unit vector parallel to 
the rotation axis. Then the vector angular velocity of B is defined as follows: 

Definition 16.2 Vector angular velocity The angular velocity vector of the body B is 
defined to be 


co = ± co n. 


( 16 . 2 ) 
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FIGURE 16.1 The rigid body B rotates with angular speed a> about a fixed 
axis parallel to the unit vector n. Its angular velocity vector is defined by 
co — ± £o/i, where the sign is determined by the sense of the rotation. 


where the sign is taken to he plus or minus depending on whether the sense of the rotation 
(relative to the vector n) is right- or left-handed. These senses are shown in Figure 16.1. 

Note. From the vector co we can deduce the angular speed, the axis direction, and the 
rotation sense about the axis. It tells us nothing about the position of the axis either in 
space or in the body. 

Example 16.1 Calculation of co 

A rigid body B is rotating with angular speed 7 radians per second about a fixed axis 
through the points 4(2, 3, — 1), B(— 4, 0, 1). The rotation is in the left-handed sense 

relative to AB. Find the angular velocity of B. 

Solution 

The position vectors of the points A and B are a = 2/ + 3 j — k and h = —4 i + k so 
that 

AB = h — a = —61 —3 j + 2 k. 

The vector n is then 

__ -6i-3y + 2* -6i-3j + 2k 

n ~ | — 61 — 3,/ + 2ft| ” 7 ' 

The rotation sense is left-handed relative to the direction of n so that the angular 
velocity of B is 

co = -In 

/ —6i —3y + 2^ 

= 6t + 3y— 2 k radians per second. ■ 
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Particle velocities and accelerations 

The particle velocities can be conveniently calculated in terms of the vector co. Let P be 
a particle of B with position vector r relative to an origin 0 located on the rotation axis 
(see Figure 16.1). Then the velocity of P has the same direction as the vector coxr. Hence 
v can be written in the form 


v = Xcox r, 

where A. is a positive scalar. To determine A., consider the magnitude of each side. The 
magnitude of v is the circumferential speed cop (see Figure 16.5) and so 

cop = k\coxr\ = A.|«t>||r| sin a = kco (OP sin cr) 

= A .cop. 

Hence A. = 1 and the formula for v is 


v = coxr. (16.3) 

This formula applies only when the origin of vectors lies on the rotation axis, hut a 
more general result is easy to obtain. Let B be any fixed point on the rotation axis. Then 

the velocity formula (16.3) still holds if the position vector r is replaced by B P= r — b , 
where b is the position vector of the point B. Hence the general formula for the velocity 
of P is given by 


v = cox (r — b) 


(16.4) 


where B is any point on the rotation axis. 

Example 16.2 Finding particle velocities and accelerations 

A rigid body is rotating with constant angular speed 7 radians per second about a 
fixed axis through the points A( 2, 3, — 1), B(— 4, 0, 1), distances being measured in 

centimetres. The rotation is in the left-handed sense relative to AB. Find the instan¬ 
taneous velocity, speed, and acceleration of the particle P of the body at the point 
(-3, 3, 5). 

Solution 

The angular velocity of this body has been determined in the last example to be 
co = 6i + 3 j — 2k radians per second. 

The velocity of P can now be found using (16.4) with 

r = — 3/ + 3 y + 5 k and b = —4 * + k. 
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This gives 


v = (6i + 3j — 2k) x (* + 3 j + 4 k) 
i j k 
= 63-2 
1 3 4 

= 18 * — 26 j + 15 k cm s“ 1 . 

This is the instantaneous velocity of P . The speed is therefore | v \ = 

(18 2 + (—26) 2 + 15 2 ) 1/2 = 35 cms -1 . 

The acceleration of P can be found by differentiating the formula (16.4) with 
repect to t. This gives 

a = oox (r — b) + cox (r — b). 

But (o is known to have constant direction and magnitude and so <o = 0 . Also b = 0 
since B is a fixed particle. This leaves 

a = cox r 

= (O X V 

= (6 i + 3 j- 2k) x (18 i - 26 j + 15 A:) 

i j k 
= 6 3-2 

18 -26 15 

= —7/ — 126 y — 210A: cms' 2 .! 

16.2 GENERAL RIGID BODY KINEMATICS 

We now move on to the more general case in which the rigid body does not have 
a fixed rotation axis. We first consider a rigid body that has one particle O that does not 
move, and we take O to be the origin of position vectors. Now the rigidity conditions 
(16.1) are equivalent to 

(r, - rj) • (#*,- - rj) = cjj (16.5) 

and because O is a particle B it follows in particular that 

=</,-, (16.6) 

where the dj are constants. On expanding the dot product in (16.5) and using (16.6) we 
obtain 


r > ' r j = e ij ’ ( 16 - 7 ) 

where the are constants. If we now differentiate (16.7) with respect to t we obtain 

/-, • rj + rj • rj = 0 for all /, j, (16.8) 

which is our preferred form of the rigidity conditions. 
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We now prove the fundamental theorems of rigid body kinematics. The details of 
the proofs are mainly of interest to mathematics students. 

Theorem 16.1 Existence of angular velocity I Let a rigid body B be in motion with 
one of its particles O fixed. Then there exists a unique vector co(t) such that the velocity 
of any particle P of B is given by the formula 


v = cox r. 


(16.9) 


where r is the position vector of P relative to O. 

This result means that, at each instant, B is rotating about an instantaneous axis 
through O. This axis is not fixed in space or in the body. 

Proof. Suppose that there exist particles E \, £ 2 , E 3 of B such that their position vectors 
{e\, ei, £ 3 } relative to O form a standard basis set. Then if there does exist an co satisfying 
(16.9), it must in particular satisfy 


e k = coxe k (16.10) 

for k = 1,2, 3. Taking the cross product of this equation with e k gives 


e k x e k = e k x(wx e k ) = (e k • e k ) 00 - (co • e k ) e k 


= co - (co - e k ) e k 


since e k is a unit vector. Summing these equations over 1 < k < 3 gives 

e k x e k = (<o • e k ) e k = 3w - co 

A k 

since the sum on the right is just the expansion of 00 with respect to the basis set {e\, e 2 , 03 }. 
Hence 


co = 




t X e k . 


(16.11) 


Thus if co does exist, it must be given by the formula (16.11), which shows that 00 is unique. 
We must now show that this co satisfies (16.9) for all the particles of the body. It is a simple 
exercise to verify that this is true for the particles E\, £ 2 , £3 by substituting (16.11) into 
(16.10) and using the rigidity conditions (16.8). Now let P be any other particle of the body 
and expand its position vector r with respect to the basis set {e\, ei, £ 3 } in the form 

r = ^2(r - e k ) e k . 

k 


The velocity of P is then given by 

v = r = ^ (r • e k + r • e k ) e k + ^ ( r * **• 

A- A 
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UJ 


FIGURE 16.2 A rigid body in general motion 
The velocities of its particles are the sum of 
those due to a translation with velocity v H 
and a rotation with angular velocity <*> about 
an axis through R. 



through B, where the particle B can he any particle of the body. This is shown in Figure 

16.2. 

Now suppose we choose a new reference particle C. Then the translational velocity 
would become v c , but what happens to the angular velocity (o'l The answer is nothing; the 
angular velocity co is independent of the choice of reference particle. This means that we 
can refer to the angular velocity of a rigid body without specifying the reference particle. 
The proof of this is as follows: 

Proof. Suppose that, with reference particles B, C, the body has angular velocities oj h . w c respectively. 
Then the velocity v of any particle P of the body is given by either of the two formulae 

v = v H + u> H x(r- b). 
v = v c + of x (r - c), 

where r is the position vector of P . It follows that 

v ,{ + ( 0 H x(r - b) = v c + 0 ) c x(r-c) (16.16) 

for any r that is the position vector of a particle of the body. In particular, since B and C are particles of the 
body, it follows that 

v H = v c + co c x(b- c), 
v H + <o B x(c - b) = v c , 

and if we now subtract each of these formulae from the equality (16.16), we obtain 

xx (r - b) = 0, 
x x (r — c) = 0. 

n 

where x = co — co . Now let P be any particle of the body not collinear with B and C and suppose that 
x ^ 0. Then x must be parallel to both of the vectors r — b and r — c, which are not parallel to each other. 
This is impossible and so x = 0, which means that co H = co c . Hence the angular velocity of the body is 
the same, irrespective of the choice of reference particle. ■ 
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Our results are summarised below: 


Particle velocities in a rigid body 

Suppose a rigid body is in general motion and that B is one of its particles. Then the 
velocity of any particle P of the body is given by the formula 

v = v l{ + wx (r - b), (16.17) 

where v H is the velocity of the reference particle B and the angular velocity a> is 
independent of the choice of reference particle. 


Example 16.3 Rolling snooker ball 


A rigid ball of radius b rolls without slipping on a horizontal table. Find the most 
general form of co consistent with the rolling condition. 

Solution 

Suppose the ball is rolling with velocity V i (where * is a horizontal unit vector), and 
has an unknown angular velocity u>. Then, on taking the centre C of the ball as the 
reference particle, the velocity of any particle P is given by (16.15) to be 

v = Vi + cox{r — c). 

In particular, the velocity of the contact particle Q is given by 

v Q = Vi + (ox(-bk), 

where the unit vector k points vertically upwards. Since the rolling condition requires 
that v@ = 0, it follows that u) must satisfy the condition 

Vi + bkxo = 0. (16.18) 

On taking the cross product of this equation with k. we obtain 

Vkxi + b kx(kxco) = 0, 
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that is 


V j + b((co • k)k — (k • k)co) = 0. 
Since k is a unit vector, k • k = 1 and we obtain 


co = — / + (co • k)k. 
b 

It follows that any oo consistent with the rolling condition must have the form 

V 

co = — j + Xk, (16.19) 

b 

where A. is a scalar function of the time. Conversely, it is easy to verify that the formula 
(16.19) for oo satisfies the rolling condition (16.18) for any choice of the scalar X. This 
is therefore the most general form of oo consistent with rolling. 

This result is surprising at first. If the motion were planar, the value of co would 
be V/b, the corresponding oo being (V / b ) j. This is the special case X = 0. But in 
general three dimensional rolling, X / 0 and the rotation axis is not horizontal. This 
effect is well known to pool and snooker players and is achieved by striking the ball 
to the right (or left) of centre, thereby giving A. a positive (or negative) value. Players 
call this putting ‘side’ on the ball. It makes no difference to the rolling but affects the 
bounce when the ball strikes a cushion. ■ 

Time to relax Find a pool table and experiment by striking a ball slowly but firmly well 
to the right of centre. The marking on the ball should enable you to ‘see’ the rotation axis 
(a ball with spots is best). Check that giving the ball ‘right hand side’ produces a positive 
value of co • k. 

Example 16.4 Wheel rolling around a circular path 

A circular wheel of radius b has its plane vertical and rolls with constant speed V 
around a circular path of radius R marked on a horizontal floor. Find the angular 
velocity of the wheel and the acceleration of the contact particle. 

Solution 

Suppose we view the motion of the wheel from the rotating reference frame 
{ O’, i, y, A:} shown in Figure 16.4. This frame rotates about the axis {O, k] with 
angular velocity S2 given by 


V 

SI = Ok = — k. (16.20) 

R 

Viewed from the rotating frame, the wheel is rotating about a fixed axis parallel to 
the vector i. On applying the rolling condition, the angular velocity co' of the wheel, 
viewed from the rotating frame, is given by 

co' =- i. (16.21) 

b 
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FIGURE 16.4 A wheel of radius b rolls around a circle of radius 
R. The unit vectors i(t) and j(t) follow the wheel as it moves 
around the circle; A: is a constant unit vector. 


The true angular velocity oo of the wheel is then given by the sum 

oo = oo' St. (16.22) 


Here we are using a result from Chapter 17, the angular velocity addition theorem ; 
it is the rotational counterpart of the addition theorem for linear velocities that we 
obtained in Chapter 2 . Although we have yet to prove this result, it is clear enough 
what it means and we will use it anyway! On substituting 16.20) and (16.21) into 
(16.22) we find the angular velocity of the wheel to be 


oo =- 1 + 

b 



(16.23) 


Now for the particle accelerations. These can be found by differentiating the 
velocity formula 


v = V j + oox(r — c), (16.24) 

with respect to t, where we have taken C, the centre of the wheel, as the reference 
particle. This gives 


dj . . 

a = V -1- wx(r-c) + wx(r-e) 

dt 

d j 

— V -1- oo x (r — c) + oo x (v — V j), 

dt 


since r = v and c = V j. In particular, since r® = c — bk and v@ = 0, the 
acceleration of the contact particle Q is given by 

a Q = V — + <dx (-bk) + oox (-V j) 
dt 

d j di , V 2 , V 2 . 

= V— + V—xk+—k + —i, (16.25) 

dt dt b R 

on using the formula (16.23) to replace oo and do. 
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The only unknown quantities left are di/dt and d j/dt. However, the vectors i, 
j correspond precisely to the polar unit vectors 7, 0 treated in section 2.3, from which 
we deduce that 


di . . V . 
— = 0 J = — J 
dt R 


dj ■ . V . 

and — = — 61 =- i. 

dt R 


On substituting these formulae into equation (16.25) we obtain 


a 



as the acceleration of the contact particle Q. ■ 


Problems on Chapter 16 

Answers and comments arc at the end of the book. 
Harder problems carry a star (*). 


16.1 A rigid body is rotating in the right-handed sense about the axis Oz with a constant 
angular speed of 2 radians per second. Write down the angular velocity vector of the body, 
and find the instantaneous velocity, speed and acceleration of the particle of the body at the 
point (4, —3, 7), where distances are measured in metres. 

16.2 A rigid body is rotating with constant angular speed 3 radians per second about a fixed 
axis through the points A( 4, 1, 1), 5(2, —1,0), distances being measured in centimetres. The 

rotation is in the left-handed sense relative to the direction AB. Find the instantaneous velocity 
and acceleration of the particle P of the body at the point (4, 4,4). 

16 . 3 A spinning top (a rigid body of revolution) is in general motion with its vertex (a particle 
on the axis of symmetry) fixed at the origin O . Let a(t ) be the unit vector pointing along the 
axis of symmetry and let < o(t ) be the angular velocity of the top. (In general, co does not point 
along the axis of symmetry.) By considering the velocities of particles of the top that lie on 
the axis of symmetry, show that a satisfies the equation 


d = coxa. 


Deduce that the most general form co can have is 

co = axa + A.a, 

where A. is a scalar function of the time. [This formula is needed in the theory of the spinning 
top.] 

16.4 A penny of radius a rolls without slipping on a rough horizontal table. The penny rolls 
in such a way that its centre G remains fixed (see Figure 16.5). The plane of the penny makes a 
constant angle a with the table and the point of contact C traces out a circle with centre O and 
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FIGURE 16.5 A penny of radius b rolls on a 
rough horizontal table in such a way that its 
centre G remains fixed. 

radius a cos a, as shown. At time t, the angle between the radius OC and some fixed radius is 
6. Find the angular velocity vector of the penny in terms of the unit vectors a(t), k shown. 
Find the velocity of the highest particle of the penny. 

16.5 A rigid circular cone with altitude // and semi-angle a rolls without slipping on a rough 
horizontal table. Explain why the vertex O of the cone never moves. Let 0(t) be the angle 
between OC , the line of the cone that is in contact with the table, and some fixed horizontal 
reference line O A. Show that the angular velocity oo of the cone is given by 

oo = — (0 cot cr) /, 

where i(t) is the unit vector pointing in the direction OC. [First identify the direction of &>, 
and then consider the velocities of those particles of the cone that lie on the axis of symmetry.] 
Identify the particle of the cone that has the maximum speed and find this speed. 

16.6* Two rigid plastic panels lie in the planes z = — b and z = b respectively. A rigid ball 
of radius b can move in the space between the panels and is gripped by them so that it does not 
slip. The panels are made to rotate with angular velocities a>\k, 002 k about fixed vertical axes 
that are a distance 2c apart. Show that, with a suitable choice of origin, the position vector R 
of the centre of the ball satisfies the equation 



R = ft x /?, 


where ft = ^(a>\ + 001 ). Deduce that the ball must move in a circle and find the position of 
the centre of this circle. [This arrangement is sometimes seen as a shop window display. The 
panels are transparent and the ball seems to be executing a circle in mid-air. | 

16.7 Two hollow spheres have radii a and b (b > a), and their common centre O is fixed. A 
rigid ball of radius A (b — a) can move in the annular space between the spheres and is gripped 
by them so that it does not slip. The spheres are made to rotate with constant angular velocities 
&)].&)2 respectively. Show that the ball must move in a circle whose plane is perpendicular to 
the vector a co \ + b coj. 
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Rotating reference frames 


KEY FEATURES 

The key features of this chapter are the transformation of velocity and acceleration between 
frames in general relative motion, and the dynamical effects of the Earth’s rotation. 


So far we have viewed the motion of mechanical systems from an inertial reference 
frame. The reason for this is simple; the Second Law, in its standard form, applies only in 
inertial frames. However, circumstances arise in which it is convenient to view the motion 
from a non-inertial frame. The most important instance of this occurs when the motion 
takes place near the surface of the Earth. Previously we have argued that the dynamical 
effects of the Earth’s rotation are small enough to be neglected. While this is usually true, 
there are circumstances in which it has a significant effect. In long range artillery, the 
Earth’s rotation gives rise to an important correction, and, in the hydrodynamics of the 
atmosphere and oceans, the Earth’s rotation can have a dominant effect. If we wish to cal¬ 
culate such effects (as seen by an observer on the Earth), we must take our reference frame 
fixed to the Earth, thus making it a non-inertial frame. The downside of this choice is that 
the Second Law does not hold and must be replaced by a considerably more complicated 
equation. 

In addition to applications involving the Earth’s rotation, there are instances where 
the motion of a system looks much simpler when viewed from a suitably chosen rotating 
frame. The Larmor precession of a charged particle moving in a uniform magnetic field 
is one example; a second is the motion of a rigid body relative to its own principal axes of 
inertia, which leads to Euler’s equations. 


17.1 TRANSFORMATION FORMULAE 

In this section we derive the transformation formulae that link the velocity and 
acceleration of a particle measured in a moving frame, with the same quantities measured 
in a fixed frame. For the purposes of kinematics, the labels ‘fixed’ and ‘moving’ arc 
arbitrary. Each frame is moving relative to the other and the labels could be reversed; we 
use them purely for convenience. In dynamics however the distinction between the fixed 
and moving frames is real. The ‘fixed’ frame is an inertial frame, in which Newton’s laws 
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n 


v 


FIGURE 17.1 The moving frame T' = [O'; e \, e\, e'^} has translational velocity 
V and angular velocity ft relative to the fixed frame T = [O; e \, e 2 , e 3 }. 

apply, and the moving frame is a non-inertial frame, in which Newton’s laws do not apply, 
at least in their standard form. 

Let T = [O; e 1 , £ 3 } he the fixed frame and T' = [O'; e\, e' 2 , e 3 } be the moving 

frame, as shown in Figure 17.1. At time t, the frame T’ has translational velocity* V and 
angular velocity ft relative to the frame T. It is convenient to regard the moving reference 
frame T’ as being embedded in a rigid body 3' with reference particle O'. Then V and 
ft are the translational and angular velocities of the body 3 '. The most important feature 
of reference frames in relative motion is this: 

The rate of change of a vector quantity measured in the frame T is generally not the 
same as the rate of change of the same quantity measured in the frame T'. 

Question Why are there different rates of change? 

Suppose u(t) is a vector quantity. Why should it have different rates of change when 
measured in the frames T and T '? 

Answer 

Suppose the expression for u in terms of the basis set [e\, ei, £ 3 } is 

tl = W| e\ + W2^2 + W3«3, ( 17 . 1 ) 

and in tenns of the basis set [ef e'-,, e 3 } is 

u = u\ e\ + u' 2 e 2 + M 3 e'y (17.2) 

In general, the components {mj, u 2 , M 3 } and {m',, u' 2 , m^} will be functions of the time 
t. 


* This means that the origin O' has velocity V relative to T■ The angular velocity SI is independent of the 
choice of O'. 
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In the rale of change of u measured in T, the basis set {e\, e 2 , £ 3 } is, by defini¬ 
tion, constant so that 



(17.3) 


In contrast, in the rate of change of u measured in T', the basis set {e\ , ef e 3 } is, by 
definition, constant so that 



(17.4) 


Note that the components {w*} and {u' k } are scalar functions of the time and so their 
rates of change arc independent of the reference frame. This is why we do not need to 
label them as being observed in T or T' . There is no reason why the two expressions 
(17.3) and (17.4) should be equal and, in general, they are not equal. Consider, for 
example, the case in which u is constant in T' so that (du/dt) = 0 . However, the 
motion of T' relative to T means that 11 will not be constant in T and (du/dt) jr will 
not be zero. ■ 

True and apparent values 

In order to simplify the writing, we will, from now on, refer to the value of a quantity 
measured in the fixed frame T as its true value, and the value measured in the moving 
frame T' as its apparent value. For example, ( du/dt)? will be referred to the true value 
of du/dt, while (du/dt)f will be referred to as its apparent value. 

Rates of change of the basis vectors {e\,e' 2 , e' 3 ) 

Our first step is to find the rates of change of the fundamental basis vectors { e , ef e 
belonging to the frame T'. Since these vectors are, by definition, constant in T', their 
apparent rates of change are zero. What we need to find are their true rates of change. 

Let E be any particle of the body B’ in which the frame T' is embedded, and let e 
and e' be the position vectors of E relative to O and O' respectively. Then, by the triangle 


law, 


€ — D e , 


where D is the position vector of O' relative to O. Then 



(17.6) 


where V is the true velocity of O' relative to O. But (de/dt)jr is, by definition, v L , the 
true velocity of the particle E, and, since £ is a particle of the rigid body B', v L is given 
by 
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on using the kinematical formula (16.17). On comparing the equations (17.6) and (17.7), 
we see that 



T 


ft x e . 


This result applies to any vector e that is the position vector (relative to O') of a particle 
of the rigid body B'. In particular, since the basis vectors {e\, e'-,, e\] can be regarded as 
the position vectors of particles of B', we obtain the fundamental relations 


= ftxe' (1 < j < 3). 

T 

The notation (de 1 \/dt)p for the true rate of change of the vector e'j is accurate but cum¬ 
bersome. Since the apparent rate of change of these vectors is zero, there is little chance 
of confusion if, from now on, we replace ( de’j/dt)jr by the simple notation e- r Our result 
can then be expressed in the form: 



True rates of change of the basis vectors {e \, e r 2 , e 3 } 
e\ = ft x e\ e 2 = ft x e\ e 2 = ft x e' 2 


(17.8) 


where ft is the angular velocity of the frame /F' relative to the frame /F. 


Relation between the true and apparent values of du/dt 

Our next step is to find the relationship between the true and apparent values of du/dt, 
where u is any vector function of the time. To do this we differentiate the representation 
(17.2) with respect to t while keeping the basis set {ee 2 , 03 } constant. This gives 



T 


| ( d(u' 2 e' 2 ) ^ | ^ 3 * 3 ) ^ 

(u\ e\ + it 2 e 2 + M 3 ^ 3 ) + u\ e\ + u’ 2 e 2 + M 3 e 2 



+ u\ (ft xe'i) + M 2 x e 2 ) -l- m 3 (ft x £ 3 ) 
T ftx(M|^| -\- U-j 6 -) + M3 ^3) 

+ ft X It. 


The true and apparent values of du/dt arc therefore related by the formula: 
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Transformation formula for du/dt 




+ flXH 


(17.9) 


where ft is the angular velocity of the frame T' relative to the frame T . 

Velocity transformation formula 

It is now easy to find how particle velocities transform between the two frames. Suppose 
a particle P has position vector r relative to T and position vector r relative to T'. It 
follows from the triangle law that 

r=D + r\ (17.10) 


where D is the position vector of O' relative to O. Then 

/dr' 

that is, 


dr\ _ / dD\ 
Jt) T = \~dt 


) 


dt 


T 


/ dr'' 

- V + U\ 


T 


where v is the true velocity of P. From the transformation formula (17.9), it follows that 


T’ 

v + ft x r , 


(17.11) 


where v' is the apparent velocity of P. On combining these results, we obtain: 


Velocity transformation formula 

v = V + ft x r + v' 


(17.12) 


This is the velocity transformation formula connecting the true and apparent velocities 
of the particle P. 

This formula has a simple interpretation. The true velocity v is the sum of (i) the 
apparent velocity v', and (ii) the velocity V + ft x r' that is given to the particle by the 
motion of the frame T '. 

An immediate consequence of the velocity transformation formula is the correspond¬ 
ing transfonnation rule for angular velocities. 


Copyrighted material 





474 


Chapter 17 Rotating reference frames 


Fixed 



FIGURE 17.2 The dark shaded cone is fixed and the light 
shaded cone rolls around it. The vertex of each cone is at 
rest at O . 

Theorem 17.1 Addition theorem for angular velocities Suppose a rigid body in 
general motion has true angular velocity co and apparent angular velocity co' . Then co 
and co' are related by the addition formula 

co = S2 + (o', (17.13) 

where SI is the angular velocity of the frame T' relative to T. [This result is the counter¬ 
part of the velocity transformation formula v = V + v' for translating frames.] 

The proof of this theorem is the subject of Problem 17.2. We have already used this 
result in Chapter 16, Example 16.4, and we give a second example now. 

Example 17.1 Cone rolling on another cone 

A circular cone with semi-angle 45° is fixed with its axis of symmetry vertical and 
its vertex O upwards. An identical cone also has its vertex fixed at O and rolls on 
the first cone so that its axis of symmetry precesses around the upward vertical with 
angular speed k as shown. Find the angular speed of the rolling cone. 

Solution 

Let J 7 be a fixed frame and J 7 ' be a frame with origin at O and precessing with the 
rolling cone. Then J 7 ' has angular velocity kk relative to J 7 , where the unit vector k 
points vertically upwards. In the frame T' the second cone has its axis of symmetry 
fixed and the first cone has angular velocity — kk. The rolling contact between the 
two cones means that the second cone must be rotating about its axis of symmetry 
[O , n) with angular speed A. in the negative sense. Its angular velocity is therefore 
—kn. In the notation of Theorem 17.1 we therefore have SI — kk and co' — —kn. It 
follows that the angular velocity of the rolling cone is 


co = SI + co' = kk — kn. 


The angular speed of the rolling cone is therefore \/2 k. ■ 
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Acceleration transformation formula 

From the velocity transformation formula (17.12), it follows that 

'dV\ 

T \ ul ) T 

that is, 


\dt ) -r V At 


+ (^) xr' + fixffi + (¥ 


d, 


T 


dt 


T 


dt 


T 


a = A + x r' + S2x 




(17.14) 


where a is the true acceleration of the particle P, and A , St and SI are the translational 
acceleration, the angular velocity, and the angular acceleration of the frame iF', all relative 
to the frame T. 

The derivative (dr /dt )^- has already been evaluated in (17.11), and the derivative 
(dv'/dt )^ r is given by the transformation formula (17.9) to be 


d v' 


dv' 


— = I — I 


dt 


T 


dt 


T' 


= a + Slxv\ 


(17.15) 


where a' is the apparent acceleration of P. On combining these results, we obtain 
a = A + SI x r + SI x (i/ + SI x r ) + (a + SI x t/) 
which, on simplification, gives the acceleration transformation formula: 


Acceleration transformation formula 

a = A + Slxr' + 2Slxv r + Slx(Slxr') + a' 


(17.16) 


where A is the translational acceleration, St is the angular velocity and St is the angular 
acceleration of the frame T', all relative to the frame T. 

The acceleration formula does not have a simple interpretation. The apparent accel¬ 
eration is a' and the acceleration given to the particle by the motion of the frame T' is 
A + S2xr' + ftx(£2x #-'). However, the true acceleration is not the sum of these two 
contributions! There is the additional term 2Slxv' that appears only when the particle is 
moving within the moving frame. This Coriolis* term, as it is called, is hard to explain 
physically. 


* After Gaspard-Gustave de Coriolis (1792-1843) whose paper on the subject was published in 1835. He 
also introduced the notion of ‘work’ with its present scientific meaning. 
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Coriolis force 

—2m ft xv' 


Centrifugal force 

—m ft x(flxr') 

FIGURE 17.3 Man walking on a rotating roundabout. 

17.2 PARTICLE DYNAMICS IN A NON-INERTIAL FRAME 

.Suppose now that T is an inertial frame. Then the equation of motion of a 
particle P of mass m moving under the force F is simply given by the Second Law 

ma = F, (17.17) 

where a is the true acceleration of P. It follows from the acceleration transformation 
formula (17.16), that, when the same motion is observed from the frame equation 
(17.17) is replaced by the transformed equation of motion 



Second Law in a general non-inertial frame 

m x r' + 2 ft x v' + £2 x (£2 x r’) + a'j = F . 


(17.18) 


Here r, v', a’ are the apparent position, velocity and acceleration of P, and A, ft and ft 
are the translational acceleration, angular velocity and angular acceleration of the frame 
T' relative to the frame T. 

This is the form taken by the Second Law in a general non-inertial frame. Note that 
there is no new physics in this equation. It is simply the Second Law written in terms of 
the apparent quantities observed in the non-inertial frame f . 

Example 17.2 Man walking on a rotating roundabout 

A fairground roundabout is made to rotate with constant angular speed ft about its 
axis of symmetry which is fixed in a vertical position, the sense of the rotation being 
as shown in Figure 17.3. A man is on the roundabout and is walking grimly towards 
the centre O' along the moving radius A'O' with constant speed V. What is the force 
that the roundabout exerts on the man? 
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Solution 

Let the frame T = [O; e i, 02 > 03 } be an inertial frame of reference attached to the 
ground and the frame T' = {O'; e’ y ee 3 } be attached to the roundabout, as shown 
in Figure 17.3. Then T' has zero translational velocity and constant angular velocity 
ft (= ft £ 3 ) relative to T. Hence A = 0 and ft = 0 in this problem. 

In the frame T ', the motion of the man is uniform and rectilinear so that r' = 
.v'j e' y v' = —V e\ and a' = 0 . The transfonned equation of motion (17.18) therefore 
reduces to 

m [2(ftc 3 ) x (—V e\) + (ft<? 3 ) x ((fte 3 ) x (x[ e\j) ] = - mge 3 + X, 
where X is the force that the roundabout exerts on the man. Hence, X is given by 

X = (mg) c 3 — (2m ft V) *2 — (mft 2 * \)e\. 

Because of the additional terms* in this expression, the man must lean forwards and 
to his left; otherwise he will fall over. ■ 

Fictitious forces 

Equation (17.18) can be made to resemble the standard form of the Second Law by the 
simple device of transferring the four new terms on the left to the right side of the equation 
and regarding them as fictitious forces that act on P in addition to the real force F. This 
gives 


ma' = F + (— mA ) + (—rnftxr') + (—2wftxi/) + (—m ft x (ft x r')), 

■ - .. ---- 

Coriolis force centrifugal force 

(17.19) 


which we will call the fictitious force equation. 

In other words, the fact that the frame T' is non-inertial may be ignored provided that 
the four fictitious forces —mA, —mSt x r', —2m ft x v' and —wft x (ft x r') are added to 
the real force F. Two of these ‘forces’ have names. The ‘force’ —2m ft x v' is called the 
Coriolis force and the ‘force’ — m ft x (ft x r') is called the centrifugal force.' In the 
problem of the man on the rotating roundabout, only the Coriolis and centrifugal ‘forces’ 
are non-zero and their directions arc shown in Figure 17.3. 

The transformed equation of motion and the fictitious force equation are obviously 
equivalent and there is no logical need to introduce the notion of fictitious forces at all. 
However, the fictitious force approach is often useful when looking at a problem qualita¬ 
tively. For example, in the problem of the man walking on the roundabout, the fictitious 
force description would be as follows; 


* If the roundabout were at rest, then X would be simply mge 3 . 

T Although the name ‘centrifugal force’ for the expression — m ft x (ft x r') is standard in the literature, it 
is not really appropriate unless O' — O. that is, the two frames of reference have a common origin. 
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the orbital motion of the Earth altogether. We will regard the Earth as an axially symmetric 
rigid body * whose axis of symmetry is at rest in an inertial frame J. Previously we have 
denoted the moving frame by T', but it would be insufferable to attach a dash to every 
quantity from now until the end of the chapter. We will therefore drop the dashes and, 
from now on, we will let T = {0;i, j, A:} be a non-inertial frame attached to the surface 
of the Earth, as shown in Figure 17.4. The orientations of the basis vectors {/, j, k } will 
be described later. Because of the Earth’s rotation, the frame T has an acceleration A and 
an angular velocity SI relative to the inertial frame X. The acceleration A is caused by the 
circular motion of the origin O around the Earth’s rotation axis and acts towards the axis. 
The angular velocity £2 is the same as the angular velocity of the Earth itself; it is therefore 

parallel to the rotation axis and constant. If n is the unit vector in the direction SN , where 
S and N are the north and south poles, then SI — + Q n, where Q is 2n radians per day, f 
or about 0.000073 radians per second. 

Let P be a particle of mass m and position vector r relative to O. Then the trans¬ 
formed equation of motion for P is 

m[A + 2£lxi;-|-£2x(£2xr)-|-a] = F E . (17.20) 

Here r, v and a are the apparent position, velocity and acceleration of P, and F E is the 
gravitational force exerted on P by the Earth. There may be other forces acting on P, but, 
for the moment, we suppose that the only force is the Earth’s gravity. 

Suppose first that P is released from rest (in T) at the point with position vector r. 
Then its initial acceleration ao is given by equation (17.20) to be 

a () = / 7/ -1 F l — A — SI x (SI x r). (17.21) 

If the Earth were not rotating, this acceleration would be the acceleration due to the Earth’s 
gravity at the point r. With rotation present, it is still the observed acceleration of a 
body released from rest, but it is not caused by gravity alone. We call this the apparent 
gravitational acceleration (at the point r). The direction opposite to a q is the apparent 
vertical direction. Let k (= k(r)) be the unit vector in the apparent vertical direction. 
Then a 1 can be written in the form 


«o = -g(r)k(r). 


(17.22) 


where g is the magnitude of the apparent gravitational acceleration. These values of g 
and k differ slightly (by less than 1%) from the corresponding quantities due to Earth’s 
gravity alone. In terms of the quantities g and k, the transformed equation of motion for 
P simplifies to give 


m 


dv 

-b 2SI x v 

dt 


—mg(r)k(r). 


* We are not restricted to regard the Earth as spherically symmetric. 

' Actually, the Earth revolves not once every day, but every 23 h 56 m. Why? 
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FIGURE 17.5 Left: The Foucault pendulum is an ordinary pendulum suspended 
from a point 0 fixed to the Earth. Right: The precessing frame OXYZ rotates 
around the axis Oz with angular velocity — (£2 cos fi)k. 


the ball will not travel down the apparent vertical, but will be displaced in the positive 
j -direction, that is, to the east. It remains to determine this easterly displacement in 
terms of //, the height of the tower. For this we need the total travel time r, but it 
is consistent with the first order approximation to determine r from the zero order 
theory which gives r = (2/z/g) 1 / 2 . Hence, because of the Earth’s rotation, the ball 
does not travel down the apparent vertical exactly but drifts to the east a distance 



1/2 

Q sin fi. 


For a ball dropped from the top of the Tower of Pisa, which is about 50 m high and 
is at latitude 44°N, this easterly drift is about 5.6 mm. ■ 


Example 17.4 The Foucault pendulum 


How does the Earth’s rotation affect the small oscillations of an ordinary pendulum? 

Solution 

Let the origin O be the suspension point of the pendulum as shown in Figure 17.5. 
The transformed equation of motion for the pendulum bob P is 


m 


dv 

— ~F 2 ft x v 
dt 


-mgk + T, 


where T is the tension force exerted by the string. The magnitude T of the tension 
force is unknown but its direction is opposite to the position vector r so that 
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where a is the length of the string. The equation of motion for P can therefore be 
written 


/// 


-b 2 S2 x v 

dt 



(17.29) 


This non-linear equation, which holds for large oscillations of the pendulum, cannot 
be solved in closed form. We will therefore restrict attention to the case of small 
oscillations which allows us to replace (17.29) by a linear approximation. The zero 
order approximation corresponds to the ‘motion’ r = — ak, in which the pendulum 
hangs in equilibrium. Thus the zero order approximation to the tension is T = mg. 

Furthermore, if we write r = xi + yj + zk, then, in any motion of the pendulum, 
x 2 + y 2 + z 2 = a 2 , from which it follows that 


= (a 2 - (x 2 + y 2 )) 


1/2 


= «(,- 


X 2 + y* 
2 a 2 


+ 


which shows that the change in z is second order in the small quantities x and y. It can 
similarly be shown that v : and dv-/dt are also second order quantities. On using these 
approximations in equation (17.29), linearised equation for the horizontal motion 
of P is found to be 


dv 11 

dt 


+ 2£2 cos j8(kx v H ) + n 2 r H 


= 0 , 


(17.30) 


where r 11 = xi + y j, v 11 = v x i + v y j and n 2 = g/a. Note that n is the angular fre¬ 
quency the pendulum pendulum would have if the Earth were not rotating. Equation 
(17.30) is the linearised equation for the motion of the point P' shown in Figure 17.5. 

The two-dimensional vector equation (17.30) is equivalent to a pair of homoge¬ 
neous second order linear equations with constant coefficients and can therefore be 
solved by standard methods. However, this is a messy job and there is a much sim¬ 
pler way of generating the solutions. The term 2Q cos fi(kxv H ) can be eliminated by 
viewing the motion from a frame* OXY Z rotating with angular velocity — (£2 cos fi)k 
relative to the frame T (see Figure 17.5). We will call this new frame the precessing 
frame and the horizontal displacement, velocity and acceleration of P in this frame 
will be denoted by R 1 *,V 11 and A u . The standard transformation formulae then give 


v H = V H + (—12 cos ftk) x R, a H = A H + (—£2 cos fik) x V (17.31) 


and if these formulae are substituted into equation (17.30), we find that the linearised 
equation of motion for P in the precessing frame can be written in the form 


d 2 R H 

dt 2 - 


+ 


( n 2 - Q 2 cos 2 R h 


= 0 , 


(17.32) 


* Note that this operation does not simply return us to the underlying inertial frame. 
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which is the two-dimensional SHM equation. Its solutions are known to be ellipses 
with centre O and angular frequency (n 2 — Q . 2 cos 2 fl) 1 ^ 2 . The difference between 
this frequency and n is completely negligible. 

Conclusion hi small oscillations, the period of the pendulum is unchanged by the 
Earth's rotation, but its motion now precesses * * around the apparent vertical with 
angular velocity —(£2 cos f$)k. In particular, an oscillation that would take place 
in a fixed vertical plane through O when £2 = 0 will now take place in a verti¬ 
cal plane through O that precesses slowly around the axis Oz with angular velocity 
— (£2 cos f)k. This precession is in the negative sense in the northern hemisphere, 
and in the positive sense in the southern hemisphere. ■ 

This practical demonstration of the Earth’s rotation is called Foucault’s pendulum ' 
and is a favourite exhibit in science museums the world over. [But not in .Singapore. 
Why?] From the duration of your museum visit, and the angle turned by the plane of 
motion of the pendulum while you were inside, you can work out your latitude! 


Winds, ocean currents and bathwater 

Atmospheric winds and ocean currents are often strongly influenced by the Coriolis force 
induced by the Earth’s rotation. We may analyse these fluid motions by analogy with 
particle dynamics. 

By analogy with equation (17.23) for the motion of a particle over the Earth, the equation of motion 
for the steady How of the atmosphere or the ocean is 


pa + 2pSlx v = —pgk — grad p. 


(17.33) 


Here, v(r) and a(r) arc the velocity and acceleration of the fluid particle with position vector r, p is the 
fluid density and p is the pressure field in the fluid. The term - grad p is called the pressure gradient and 
represents the force exerted on a fluid particle by the fluid around it. The acceleration a can be written in 
terms of u. If we follow the progress in time of a fluid particle, it follows from the chain rule that 


a = 


dv dx dv d y dv dz 
dx X ~d7 + dy X -d7 + dz X ~dt 


dv 


dv 


dv 


= V x - -f- Vy- -b Vr — . 

dx dy dz 


(17.34) 


Let us now compare the sizes of the two acceleration terms on the left of equation (17.33). Suppose that 
the flow has characteristic length scale L and velocity scale V . Then the Coriolis acceleration 2 it x v is of 
order £2V and, from equation (17.34), the acceleration a is of order V 2 /L. The dimensionless ratio of these 
two magnitudes, given by 


Ro = 


V 

OZ/ 


(17.35) 


is called the Rossby number^ of the flow. When the Rossby number is small , the Coriolis acceleration 
dominates over the real acceleration . When this condition is satisfied, the equation of motion (17.33) can 
be approximated by 


IpSLxv = —pgk — grad p. 


(17.36) 


* A precession is a slow rotation superimposed on a fast motion. 

7 After Jean Bernard Leon Foucault (1819-1868), who first exhibited it at the 1851 Paris Exhibition. This 
was the first time the Earth’s rotation had been demonstrated by an entirely terrestrial method. 

* After the Swedish-American meteorologist Carl-Gustav Rossby (1898-1957). 
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As in the standard linear momentum principle, the double sum of the [G/j] is zero. In terms of centre of 
mass variables, this equation can be written 


M 


A + Six R + 2SlxV + Slx(Slx R) + 


dV~ 

~dt 


= F. 


(17.38) 


where R and V are the position and velocity of the centre of mass, M is the total mass, and F is the total 
external force. This is the centre of mass form of the linear momentum principle in a non-inertial frame. 

Wc have therefore proved that: 


Linear momentum principle in a non-inertial frame 

In any reference frame, the centre of mass of a system moves as if it were a particle 
of mass the total mass, and all the external forces acted upon it. 


Hence, even in a non-inertial frame, the motion of the centre of mass of a system can 
still be calculated by particle mechanics. 


Energy principle 

There is also a form of energy conservation that applies in frames with a fixed origin and 
constant angular velocity. The proof is as follows: 

In this case the acceleration A and the angular acceleration ft are zero and the transformed equation 
of motion for the particle P\ becomes 


m; 


dV; 

2 ft x u/ + ft x (ft x r,-) H—— 

dt 


N 


-Fi + T,o,j. 
7 = 1 


in the standard notation. If we now take the scalar poduct of this equation with v t -, the Coriolis term vanishes 
and we obtain 


m. 


v i 


[ft X (ft X /*;)] + Vj 


dvj 

~dt 


N 

= Fj • vi + £ Gjj • Vj 


(17.39) 


“ J 7=1 

(1 < / < N). This is the same equation as for an inertial frame except for the centrifugal term. This term 
can be written 


Vj • [ft x (ft x #■/)] = Vj • ^(ft • r/)ft — Q 2 r/ j 


= (ft • l>/)(ft • #•;) - )j • rj 


= * r ,) 2 - \si 2 ri • r/ 

- *L l-ln 2 ,, 2 

~ dt I 2“ P ‘ 


where p / is the perpendicular distance of P from the rotation axis. On substituting this formula into 
equation (17.39) and summing over all the particles we obtain 


d 

di 


N 

jmj Vj • Vj 

i=l 


%a 2 Y m iP? 


i =i 


N 


N N 


E F ‘ ■ ».■ + E E °u 


/=! 


/=W=1 
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FIGURE 17.6 The straight rod rotates with 
constant angular speed Q around O and the 
square lamina slides along the rod. 


j. (t = + < l7 - 40 > 

/ = 1 i=l j= 1 

where T is the apparent total kinetic energy and Ij^ is the moment of inertia of the system about the 
rotation axis A. The right side of equation (17.40) is the apparent total rate of working of all the forces. If 
the apparent total work done can be represented by the potential energy V, we will say that the system is 
apparently conservative. In this case, we have conservation of energy in the form 

T + V - = E, 

where E is a constant. The term — is sometimes called the centrifugal potential energy and we 

will call this equation the transformed energy equation. Our result is summarised as follows: 



Energy conservation in a uniformly rotating frame 

If a system is apparently conservative when viewed from a rotating reference frame 
with a fixed origin and constant angular velocity, energy conservation holds in the 
modified form 

T + V - \/ A n 2 = E, (17.41) 

where T and V are the apparent kinetic and potential energies, Q, is the angular 
speed of the reference frame, and I A is the moment of inertia of the system about 
the rotation axis. 


For systems with one degree of freedom, this energy conservation principle is suffi¬ 
cient to determine the motion, as in the following example. 

Example 17.5 

A long straight smooth rod lies on a smooth horizontal table. One of its ends is fixed 
at point O on the table and the rod is made to rotate around O with constant angular 
speed £2. A uniform square lamina of side 2 a can slide freely on the table. One of 
its sides is in contact with the rod at all times and the lamina slides freely along the 
rod. Initially, the lamina is at rest (relative to the rod) with one corner at O. Find the 
subsequent displacement of the lamina as a function of the time. 
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Solution 

Wc will view the motion of the lamina from a reference frame with origin 0 in which 
the rod is at rest. This frame has a fixed origin and constant angular velocity. 

First we find the apparent kinetic energy. Let r be the distance OC shown in 
Figure 17.6. In the rotating frame, the motion of the lamina is rectilinear and its 
apparent kinetic energy is therefore given by T = \Mr 2 . 

Next we find the apparent potential energy. There are no specified external forces, 
but there is the constraint force exerted on the lamina by the rod, which, since the rod 
is smooth, acts perpendicular to the rod. In an inertial frame, this constraint force 
does work, but, in the frame rotating with the rod, the lamina simply moves along 
the stationary rod. The apparent rate of working of the constraint force is there¬ 
fore zero. The internal forces within the lamina enforce rigidity and do no appar¬ 
ent work in total. The system is therefore apparently conservative with potential 
energy V = 0. 

The transformed energy equation is therefore 

|Mr 2 - \IaS1 2 = E, 

where I a is the moment of inertia of the lamina about the vertical axis through O , and 
£2 is the angular speed of the rod. By the ‘parallel axes’ theorem (see the Appendix 
at the end of the book), I a is given by 

IA = Ig + ('a 1 + » 

so that the equation becomes 

jMr 2 - \ (/ G + a 2 + r 2 ) d 2 = E. 

The initial conditions r = a and r = 0 when t = 0 give 

E = ~\ (/ c +2a 2 ) 

so that the transformed energy equation for the lamina is finally given by 

f 2 =a 2 (r 2 -<! 2 ). 

This equation is sufficient to determine the motion. On taking square roots and inte¬ 
grating, we find that the displacement of the lamina at time t is 

r = a cosh £2t, 

which is the same answer as for a particle sliding along a rotating wire. ■ 
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Problems on Chapter 17 

Answers and comments are at the end of the book. 
Harder problems carry a star (*). 


Kinematics in rotating frames 

17.1 Use the velocity and acceleration transformation formulae to derive the standard expres¬ 
sions for the velocity and acceleration of a particle in plane polar coordinates. 

17.2 Addition of angular velocities Prove the ‘addition of angular velocities’ theorem. The¬ 
orem 17.13. [Hint. For a general particle of the rigid body, find v and v' in terms of go and go' 
respectively. Then relate v and v' by the velocity transformation formula.] 

17.3 A circular cone with semi-angle a is fixed with its axis of symmetry vertical and its 
vertex O upwards. A second circular cone has semi-vertical angle (n /2) — a and has its vertex 
fixed at O. The second cone rolls on the first cone so that its axis of symmetry precesses 
around the upward vertical with angular speed A. Find the angular speed of the rolling cone. 

Dynamics in rotating frames 

17.4 A particle P of mass /// can slide along a smooth rigid straight wire. The wire has one 
of its points fixed at the origin O, and is made to rotate in a plane through O with constant 
angular speed £2. Show that r, the distance of P from O, satisfies the equation 

r - Qrr = 0. 

Initially, P is at rest (relative to the wire) at a distance a from O. Find r as a function of t in 
the subsequent motion. 

17.5 Larmor precession A particle of mass m and charge e moves in the force field F (r) 
and the uniform magnetic field Bk, where A: is a constant unit vector. Its equation of motion is 
then 


dv ( eB \ 

m — = | — I v x k T- F (r) 
dt \ c ) 

in cgs Gaussian units. Show that the term (eB/c)v x k can be removed from the equation by 
viewing the motion from an appropriate rotating frame. 

For the special case in which F(r) = —into^r, show that circular motions with two 
different frequencies arc possible. 

17.6 A bullet is fired vertically upwards with speed w from a point on the Earth with 
co-latitude ft. Show that it returns to the ground west of the firing point by a distance 
4£2w 3 sin /3/3g 2 . 

17.7 An artillery shell is fired from a point on the Earth with co-latitude ft. The direction of 
firing is due south, the muzzle speed of the shell is w and the angle of elevation of the barrel is 
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a. show that the effect of the Earth’s rotation is to deflect the shell to the east by a distance 

. 2 . . 

-sin" c* (3 cos cr cos /? + sin a sin /f). 

3 g l 

17.8 An artillery shell is fired from a point on the Earth with co-latitude The direction of 
firing is due east, the muzzle speed of the shell is u and the angle of elevation of the barrel is 
a. Show that the effect of the Earth’s rotation is to deflect the shell to the south by a distance 

4 £2t / 3 2 

-r- sin" a cos cr cos 6. 

3 # 2 

* Show also that the easterly range is increased by 

4£ 2 w 3 . 

-z- si 

3<?? 2 

[Hint. The second part requires a corrected value for the flight time.] 

Energy conservation in rotating frames 

17.9 Consider Problem 17.4 again. This time find the motion of the particle by using the 
transformed energy equation. 

17.10 One end of a straight rod is fixed at a point O on a smooth horizontal table and the rod 
is made to rotate around O with constant angular speed £2. A uniform circular disk of radius 
a lies flat on the table and can slide freely upon it. The disc remains in contact with the rod 
at all times and is constrained to roll along the rod. Initially, the disk is at rest (relative to the 
rod) with its point of contact at a distance a from O. Find the displacement of the disk as a 
function of the time. 

17.11 A horizontal turntable is made to rotate about a fixed vertical axis with constant angular 
speed £2. A hollow uniform circular cylinder of mass M and radius a can roll on the turntable. 
Initially the cylinder is at rest (relative to the turntable), with its centre of mass on the rotation 
axis, when it is slightly disturbed. Find the speed of the cylinder when it has rolled a distance 
x on the turntable. 

* Find also an expression (in terms of .v) for the force that the turntable exerts on the 
cylinder. 

Hydrostatics in rotating frames 

When a fluid is at rest in an inertial frame, the equation of hydrostatics is 

0 = F — grad p, 

where p is the pressure field in the fluid, and F is the body force (force per unit volume) acting on the 
fluid. This equation means that the body force is balanced by the pressure gradient. In the case of uniform 
gravity, F = —pgk and, if p is constant, the equation integrates to give the well known ‘hydrostatic pressure 
formula’ p — po — pgz, where Oz points vertically upwards. 

If a fluid is at rest in a rotating frame with a fixed origin and constant angular velocity ft, then the 
equation of ‘hydrostatics’ in this frame is 

pftx(ftxr) = F — grad p. 
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FIGURE 17.7 A mass of liquid in equilibrium (left) and rotating like a rigid body (centre and 
right, cut in half to show the cross sections). The right figure shows the mass rotating with the 
critical angular speed. 


where p is the fluid density. (This equation means that the body force, the pressure gradient and the 
centrifugal 'force' must balance.) This is all you need to know to answer the following questions. 


17.12 Newton’s bucket A bucket half full of water is made to rotate with angular speed £2 
about its axis of symmetry, which is vertical. Find, to within a constant, the pressure field in 
the fluid. By considering the isobars (surfaces of constant pressure) of this pressure field, find 
the shape of the free surface of the water. 

What would the shape of the free surface be if the bucket were replaced by a cubical box? 

17.13 A sealed circular can of radius a is three-quarters full of water of density p, the remain¬ 
der being air at pressure po. The can is taken into gravity free space and then rotated about its 
axis of symmetry with constant angular speed Q,. Where will the water be when it comes to 
rest relative to the can? Find the water pressure at the wall of the can. 

Computer assisted problem 

The following problem is suitable for a supervised project. 

17.14** If a mass of liquid in gravity free space is in equilibrium, then it has the form of 
a sphere, stabilised by its own surface tension (sec Figure 17.7 (left)). Suppose that the mass 
now rotates like a rigid body with constant angular velocity. What will its equilibrium shape 
be now? Show that, as the angular speed increases, the mass becomes more oblate until its 
top and bottom surfaces become quite flat (Figure 17.7 (centre)). Show further that, at higher 
angular speeds, the top and bottom surfaces approach each other until they meet (Figure 17.7 
(right)) and the mass breaks up. Find this critical angular speed for a sphere of mercury of 
radius 5 cm. 
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Chapter Eighteen 


Tensor algebra 
and the inertia tensor 


KEY FEATURES 

The key features of this chapter are the transformation formulae for the components of ten¬ 
sors and tensor algebra; the inertia tensor and the calculation of the angular momentum and 
kinetic energy of a rigid body; and the principal axes and principal moments of inertia of a 
rigid body. 


We have previously regarded a vector as a quantity that has magnitude and direction. 
Picturing a vector as a line segment with an arrow on it has helped us to understand essen¬ 
tially difficult concepts, such as the acceleration of a particle and the angular momentum 
of a rigid body. Neither of these quantities has any direct connection with line segments, 
but the picture is a valuable aid nonetheless. Useful though this notion of vectors is, it is 
not capable of generalisation and this is the main reason why we will now look at vectors 
from a different perspective. In reality, what we actually observe are the three components 
of a vector, and the values of these components will depend on the coordinate system in 
which they are measured. However, a general triple of real numbers {tq, V 2 , U 3 }, defined 
in each coordinate system, does not neccessarily constitute a vector. The reason is that the 
components of a vector in different coordinate systems are related to each other (in a way 
that we shall determine) and, unless the quantities {iq, V 2 , U 3 } satisfy this transformation 
formula, they are not the components of a vector. This leads us to a new definition of a 
vector as a quantity that has three components in each coordinate system, whose values in 
different coordinate systems are related hv the vector transformation formula. 

Defined in the above way, vectors can be regarded as part of a hierarchy of entities 
called tensors with a tensor of order n having 3" components. A tensor of order zero, 
which has one component, is a scalar, and a tensor of order one, which has three com¬ 
ponents, is a vector. Tensors of higher order, which are entirely new objects, are defined 
to be quantities with 3” components that obey the appropriate transformation formula. 
In this chapter we give a gentle introduction to the principles of tensor algebra, which 
should be useful, not only for its present application to dynamics, but also in other areas 
such as relativity. Any account of tensor algebra is mathematics, not mechanics, and some 
readers may find this indigestible. However, there arc very few basic principles involved 
and, once these are mastered, the subject seems (and actually is) rather easy. 
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FIGURE 18.1 The Cartesian coordinate systems C = Ox \X2X3 and 

C' = Ox'yx'^x'^ have the associated unit vectors {e\, e 2 , 03 } and 

[e \. e\, e'j) respectively. The two systems actually have a common origin 

but are drawn separately for clarity. 


We then introduce the inertia tensor, a second order tensor that is used to calculate the 
angular momentum L and kinetic energy T of a generally rotating rigid body. It transpires 
that, by taking a special choice of axes, known as principal axes, the formulae for L and 
T are greatly simplified and reduce to non-tensorial forms. This makes it possible to solve 
most problems in rigid body dynamics without any knowledge of tensors at all.* The few 
results that one really needs in rigid body dynamics are given at the end of Section 18.6. 


18.1 ORTHOGONAL TRANSFORMATIONS 

Let C = Ox 1 * 2*3 and C = Ox^x^x^ be two ("artesian coordinate systems having 
the common origin O and associated unit vectors {e \, ej, £ 3 } and { e \, e 2 , e 3 } respectively 
(see Figure 18.1). Then any vector v (understood in the elementary sense of Chapter 1) 
can be expanded in the form 


v = v\ e 1 + V 2 02 + U 3 £ 3 , (18.1) 

where {ui, V 2 , ^ 3 } are the components of v in the coordinate system C. Similarly v can be 
expanded in the form 


v — ^ 1^1 T ^2 6 f ^3 ^ 3 ’ (18.2) 

where {i/ ( , v' 2 , U 3 } are the components of v in the coordinate system C. These two sets of 
components are related to each other. On taking the scalar product of the equality 

p| e | T v 2 e 2 T v-^e-^ = v\ e \ T V2 ^2 T ^3 ^3 (18.3) 


* Don’t spread this information around! 
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successively with e \, e'-, and e 3 we find that 

v\ = (e\ • e\) v\ + (e\ • e 2 ) v 2 + ( e\ • e 3 ) v 3 , 

v' 2 = (e' 2 • £ 1 ) v\ + ( e' 2 • e 2 ) v 2 + (e' 2 • £ 3 ) ^ 3 , 

v 3 = ( e 3 * e >) v l + ( e 3 ' ^2) ^2 + (®3 • £3) y 3> 

which can be written in the matrix form 


v' = A • v 


(18.4) 


where 

/ v\ \ /v\ \ / (<?', • t?i) (e'j • e 2 ) (e\ • ^)\ 

v = ( y 2 I ’ V = I v 2 I , A = (e 2 • e\) (e 2 • c 2 ) * * 3 ) I * (18.5) 

\vj/ \t>3/ \(«3 ' «l) («3 • «2) («3 • «3)/ 

and the ‘ ’ means the matrix product. This is the required relationship between the 
components of v in the two coordinate systems. The elements of the transformation 
matrix A depend only on the orientation of the system C' relative to the system C. 

The inverse relationship to (18.4) can be found by taking the scalar product of equa¬ 
tion (18.3) successively with e\, e 2 and e 3 . The result is that 


v\ = (e\ • e\) uj + (e\ • e 2 ) v 2 + (e\ • e 3 ) v 3 , 

v 2 = (e 2 • e\) v\ + (e 2 • e 2 ) v 2 + (e 2 • e' 3 ) v 3 , 

v 3 = ( e 3 • e\) vi + ( e 3 • e 2 ) v 2 + (e 3 • e 3 ) v 3 . 


which can be written in the matrix form 


v = A 7 • v' 


(18.6) 


where A 7 is the transpose of A. If we now substitute equation (18.4) into equation ( 1 8.6) 
we obtain 

v = A 7 • (A • v) = (A 7 • A) • v. 

Since this formula holds for every choice of v, it follows that A r • A = 1 , where 1 is the 
identity matrix. This means that the transformation matrix A has the special property that 
A 7 = A . Such matrices are called orthogonal. 

Definition 18.1 Orthogonal matrix A matrix having the property that its transpose is 
also its inverse, that is, 

A r • A = A • A r = 1 , (18.7) 


is called an orthogonal matrix. 
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FIGURE 18.2 Left: The coordinate system C' = Ox',xLx\ is obtained by rotating the 
coordinate system C = Ox \X2X3 through an angle \fr about the axis Ox 3 . Right: The 
unit vector e' is obtained by rotating the unit vector e through an angle i/r about the 
general axis { O. n }. 


Example 18.2 Rotation ofC about a coordinate axis 

The coordinate system C' is obtained by rotating the coordinate system C through an 
angle of 30° about the axis Ox\. Find the transformation matrix A between C and C'. 
A vector v has components {3,-1, 2} in C. What are its components in C'l 

Solution 

The formula corresponding to (18.8) when C is rotated through an angle xfr about the 
axis Ox 1 is 


/I 0 O' 

A = I 0 cos xj/ sin x/s 
\ 0 — sin ^ cos xfr 

Hence, when xj/ = 30°, the transformation matrix is 

/I 0 O' 

A = 0 V3/2 1/2 

\0 - 1/2 V 3 / 2 , 

The components of the vector v in C' are then given by the elements of the column 
vector v\ where 


3\ /1 0 0 

v' = A • | -4 ] = 0 73/2 1/2 

2/ \0 -1/2 >/3/2 



The components of the vector v in C' are therefore {3, 1 — 2\/?>, 2 + \/3). ■ 

In the general case, the rotation of C can take place about an axis {O, n}, where n is 
any unit vector. The argument is messier but we can still find an explicit formula for A. 
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C to C' is 



The components of the vector v in C are then given by the elements of the column 
vector v\ where 


v 


= A' 



1 

3 





The components of the vector v in C arc therefore {1, — 11, 2}. ■ 


Reflections 

The coordinate system C' may also be defined by reflecting C in a plane through O. For 
example, if C is reflected in the coordinate plane Ox\X 2 , the transformation matrix is 
easily found to be 


A = 



In the general case, if C is reflected in the plane through O with unit normal vector n, then 
the transformation matrix is 

( 1—2 —2/11/12 — 2/ii/?3\ 

— 2 / 12/71 1 — 2 / 1 ^ — 2 / 72/13 I , (18.11) 

-2/13/71 -2/13/72 1-2 n\) 

where n j, / 12 , nj, are the components of the vector n in C. [The method of proof is similar 
to that for the general rotation.] 


Exam pie 18.4 Reflection of axes 

The coordinate system C' is obtained by reflecting the coordinate system C in the 
plane *3 = 2.V| + 2,V2. Find the transformation matrix A between C and C'. 

Find also the transformation matrix when C' is obtained from C by performing 
the above reflection followed by a rotation of 90° about the new „V 3 -axis. 

Solution 

The equation X 3 = 2x\ + 2x2 can be written in the form 
{x\e\ + X2C2 + T3C3) • { 2 e\ + 2c 2 — £3) 

and, by comparison with the standard vector equation r • n = p of a plane, we see 
that the unit vector 

2 2 I 

n= - e 1 + -e 2 - -£3 
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is normal to the reflection plane. Hence //) = 2/3, 112 = 2/3 and H 3 = —1/3. It 
follows from the formula (18.11) that the transformation matrix between C and C' is 


A 




If this transformation is followed by a rotation whose transformation matrix is 

0 1 0 

-10 0 
0 0 1 

then the overall transformation has matrix 

/ 1-8 4 

-8 14 

\ 4 4 7 


-8 1 4 

= 5 |-' 8 “ 4 

'447 


0 I O' 

5 1 _l 00 
1 0 0 1 


It can be proved that the most general orthogonal transformation corresponds either 
to (i) a rotation of coordinates, or (ii) a reflection followed by a rotation. Which of these 
classes a particular orthogonal transformation belongs to can be decided by examining the 
sign of the determinant det A. 

If we take determinants of equation (18.7) we obtain 

detl = det ^A 7 • A^ = det A r x det A = det A x det A = (det A) 2 , 

from which it follows that det A can only take the values ±1. Surprisingly, we can work 
out the determinant of the general A defined by equation (18.5) quite easily. The first row 
of A contains the three components of the vector e\ in the coordinate system C. Likewise, 
the second and third rows contain the components of the vectors e'j and e\. It follows from 
the vector formula (1.7) that, provided C is a right-handed coordinate system, 

det A = («',x4) . 4 . 

We can now see how the different signs for det A can occur. If C is also a right-handed 
coordinate system, then the triple scalar product (e\ xe\) • e'^ = + 1 , whereas if C is a 
left-handed system then (e', xe'^) • = — 1 . If two coordinate systems have the same 

handedness, then one can be made coincident with the other by a suitable rotation about 
an axis through O. If they have opposite handedness then a reflection in a plane through 
O is also needed. For this reason, orthogonal matrices with determinant +1 are sometimes 
called rotation matrices. 


18.3 SCALARS, VECTORS AND TENSORS 

Our view of a vector as a quantity that has magnitude and direction has served 
us well. By picturing a vector as a line segment with an arrow on it, we have been able 
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to understand essentially difficult concepts, such as acceleration and angular momentum, 
far more easily. Neither of these quantities has any direct connection with line segments, 
but the picture is a valuable aid nonetheless. However, this notion of vectors cannot be 
generalised to tensors. This is the main reason we will now look at vectors (and even 
scalars) in a different light. 

Scalars 

We begin by giving the true definition of a scalar. 

Definition 18.2 Scalar Let 0 be a real number defined in each coordinate system.* 
Then 0 is said to be a scalar if it has the same value in every coordinate system. That is, 
for any pair of coordinate systems C and C. 


Definition of a scalar 

0 ' = 0 - 


(18.12) 


This can be alternatively expressed by saying that a scalar is invariant under change of 
coordinate system. 

It is this invariance under change of coordinate system that is the essential feature of 
a scalar. Thus the mass of a particle and the length of a line are scalars. However, the 
sum of the coordinates of a given point of space (which is a single real quantity defined in 
each coordinate system) is not a scalar since it is not invariant under change of coordinate 
system. 

Vectors 

We turn now to the definition of a vector in terms of its components. The individual 
components of a vector arc certainly not invariants. They transform according to the 
transformation formula (18.4). It is this transformation formula that becomes our new 
definition of a vector. 

Definition 18.3 Vector Let {iq, tq* y 3} be a set of three real numbers defined in each 
coordinate system. Then {iq , U 3 } ore said to be the components of a vector if their val¬ 

ues in any pair of coordinate systems C and C' are related by the transformation formula 


Definition of a vector (matrix form) 
v = A • v 


(18.13) 


where A is the transformation matrix between C and C. 


* These are rectangular, Cartesian coordinate systems with common origin O . 
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This definition of a vector is clearly consistent with our previous point of view. Note 
that a set of three real numbers defined in each coordinate system does not in gen¬ 
eral constitute a vector; the components must be related to each other by the transfor¬ 
mation formula (18.13). For example, the Cartesian coordinates (/q,/? 2 , 63 ) of a par¬ 
ticle B are a set of three real numbers defined in each coordinate system and these 
numbers are related by the transformation fonnula (18.13). Thus [b\,b 2 , 63 } is a 
vector. Suppose now that B is a moving particle with time dependent coordinates 
(MO.MO.MO). Then, at each time t, {bi(t),b 2 (t),b$(t)} is also a set of three of 
real numbers defined in each coordinate system. The transformation formula (18.13) 
is again satisfied since the matrix A does not depend upon the time. Hence, at each 
time t, {b\(t), l? 2 (t), bj,(t)} is a vector. This is, of course, the velocity vector of the 
particle B at time t, and a second differentiation yields the acceleration vector of B. 
In contrast, {2b\ , bj, b$} is a set of three real numbers defined in each coordinate sys¬ 
tem, but the transformation formula is not satisfied. Therefore {2b\ , bj, /?3 } is not a 
vector. 


Tensors 

In order to generalise our definition of a vector, it is necessary to write the vector transfor¬ 
mation formula (18.13) in the following suffix form: 


Definition of a vector (suffix form) 

3 

v i = J2 a u v j 

7 = 1 


(18.14) 


(1 < i < 3), where a pq is the element in the p- th row and < 7 -th column of the 
matrix A. The matrix product in equation (18.13) is equivalent to the summation in 
equation (18.14). 

Defined in the above way, vectors can be regarded as part of a hierarchy of entities 
called tensors with a tensor of order n having 3" components. A tensor of order one, 
which has three components, is a vector; a tensor of order zero, which has one compo¬ 
nent, is a scalar. The general definition of tensors looks extremely daunting when first 
encountered. Below we give the definitions of second and third order tensors, from which 
the general case can be inferred. 

Definition 18.4 Second order tensor Let {t/j} (1 < i,j < 3) be a set of nine real 
numbers defined in each coordinate system. Then the {tjj} are said to be the components 
of a second order tensor if their values in any pair of coordinate systems C and C are 
related by the transformation formula 
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Second order tensors in matrix form 

In the general case it is not possible to give any simpler form for the tensor transformation 
formula. However, for the special case of tensors of the second order (which is what we 
are mainly concerned with), the transformation formula can be written in a more user- 
friendly form in terms of matrix products. 

In each coordinate system, a second order tensor {f,-y} has nine components, indexed 
by the integers /, j (1 < /, j < 3). It is natural then to display these components as a 3 x 3 
array, that is, as the elements of a 3 x 3 matrix. The tensor { tjj } can then be regarded as a 
3 x 3 matrix 


T = 


( 

\ 


t\ 1 t\2 t\3 
hi hi hi 
hi hi hi 



defined in each coordinate system. Also, the tensor transformation rule (18.15) can be 
wrtten in the form 


3 3 


3/3 


f ij = 5^ °ik a ji hi = XM a ' k hi 


Clj, 


k= 1 /= 1 
3/3 


/=1 \*= 1 


= \J2 a ' ktki ) a u' 


1=1 \k= 1 


where the [aj )q } are the elements of the transposed matrix A r . The summations in this last 
expression are equivalent to matrix products and so the transformation rule for a second 
order tensor can be expressed in the form 

T' = A • T • A r . 


This gives the following alternative definition: 

Definition 18.6 Second order tensor (matrix form) Let T be a 3 x 3 matrix defined 
in each coordinate system. Then T is said to represent a second order tensor if its values 
in any pair of coordinate systems C and C are related by the transformation formula 


Definition of a second order tensor (matrix form) 

T = AT-A r 


(18.17) 


where the A is the transformation matrix between C and C'. 

This definition has the virtue of being expressed in terms of the familiar operation of 
matrix mutiplication. It only applies to second order tensors but that is exactly what we 
need. The inertia tensor is second order, as are many other important tensors of physics. 
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Example 18.5 Transforming a second order tensor 


In the coordinate system C, a second order tensor has the components t\ \ = 1 , t\2 = 2, 
t\ 3 = 3, t2\ = 4, /22 = 5, /23 = 6 , t 3 i = 7, t^2 = 8 , ^33 = 9. Find the components of 
the tensor in C' when 

(i) C' is obtained from C by a rotation of 90° about the axis Ox 3 , and 

(ii) when C is obtained from C by a reflection in the plane *3 = 0. 


Solution 

In the coordinate system C, the matrix of components is given to be 


T = 



Part (i). From the formula (18.8), the transformation matrix of the required rotation 
of coordinates is 



The transformation formula (18.17) then implies that, in the coordinate systen C', the 
tensor is represented by the matrix 


r = A . T . A 7 




Hence the components of the tensor in C' are /[, =5, t\ 2 = — 4, /[ 3 = 6 , = —2, 

t 22 = 1 * *23 = — hi = 8 ’ = — 7 ’ ^33 = 

Part (ii). The transformation matrix of the required reflection of coordinates is 


A = 


The transformation formula (18.17) then implies that 



T = A • T • A 7 = 
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Another important application of contraction is as follows. Suppose that {tjj} is a 
second order tensor and that {«,} is a vector. Then the outer product {r,yu*} is a third order 
tensor. If we now contract this third order tensor with respect to the suffix pair j, k, we 
obtain the vector {u>,} given by 


3 

Wj = ^ tjj u j . 

7=1 

This formula looks nice in the matrix formulation. If {tjj } is represented by the matrix T 
and {w,} by the column vector u, then {u, } is represented by the column vector v given by 

v = T • u. 

In other words, if the matrix T represents a second order tensor and the column vector 
u represents a vector, then the product T • u represents a vector. Thus prc-multiplicaton 
by T transforms one vector into another vector. This is a common way in which tensors 
act in physics. For example, in crystalline materials, the electric vectors E and D are not 
parallel. In each coordinate system, they are related by the fonnula 

D = K • E. 

where the 3 x 3 matrix K represents the dielectric tensor, the crystalline equivalent of 
the dielectric constant. This tensor relationship between D and E is the cause of double 
refraction in crystalline materials. The inertia tensor has a similar role, transforming the 
angular velocity vector into the angular momentum vector. 

The angular velocity vector 

We have already shown that the velocity of a particle is a vector in the sense of the defi¬ 
nition (18.13), but we have so far said nothing about the transformation properties of the 
angular velocity of a rigid body. This is made more awkward to decide by the fact that &> 
is defined indirectly. Suppose that one of the particles of a rigid body is fixed at the origin 
O. Then o> is essentially defined to be that ‘vector’ that gives the velocities of the particles 
of the body by the formula v = wx r. In each coordinate system this formula can also be 
expressed by the matrix product 


v = n • x. 


where 


( V{ ) 

1 


0 

— COT, 

0)2 \ 

v = U 2 

• n = 


COT, 

0 

-CO] I 

IcJ 

! 


-002 

CO\ 

0 / 



[Check this.] Now {,v, } and {u, } are known to transform as vectors, but nothing is known 
about the transformation properties of the matrix Cl. However, it can be proved that if 
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Definition 18.7 Inertia tensor The second order tensor {/, 7 } defined by 


Definition of the inertia tensor 



( 18 . 22 ) 


is called the inertia tensor of the body 3 at the point O. Note that the inertia tensor is 
symmetric, that is, /y,- = /,•_/. 

We have therefore proved that, when a rigid body is rotating with angular velocity a> 
about an axis through origin O, its angular momentum Lq about O is given by 


I L\ \ / /11 /12 /l3 \ / U>\ \ 

I Z.2 I = I ill ill ill I I 0)2 I 

\L 3/ V /31 ill ill/ \<^3 / 

This can be written in matrix form as 


Angular momentum formula 

Lq = I • W 


( 18 . 23 ) 


( 18 . 24 ) 


where the column vectors L o and w are formed from the components of Lo and w respec¬ 
tively, and I is the matrix form of the inertia tensor {/,y}. Because L <9 is obtained from 
u by a matrix multiplication, it follows that Lo and 00 will generally lie in different 
directions. This fact (which does not show itself in planar mechanics) is what gives three- 
dimensional rigid body motion its special character. 

The inertia tensor also appears in the corresponding expression for the kinetic energy 
of 3. By following a similar procedure to that used for angular momentum, it is found 
that T is given by 


T 


( i 11 i\i i\i\ 
ii\ ill ill I 
h\ in in/ 



which can be written in the matrix form 


( 18 . 25 ) 


Kinetic energy formula 
7 = 2 w r • I • co 


( 18 . 26 ) 
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The elements of the inertia tensor 

The diagonal elements of I are actually familiar quantities. For example 

1 11 ((*? +x 2 + -^l) ^11 

= J2 m (4+4) 

= J2 m p 2 i’ 


where p \ is the perpendicular distance of the typical particle P from the axis Ox\. Hence 
/ii is just the ordinary moment of inertia of the body about the axis Ox\. Similarly, I 22 
and / 33 are the moments of inertia of the body about the axes Ox 2 and O .*3 respectively. 
The off-diagonal elements of I arc given by 

/12 = h\ = ~ ^mx x x 2 , (18.27) 

1 23 = h 2 = — ^ mx 2 ^ 3 , (18.28) 

/31 = /13 = - ^mx?,x\, (18.29) 

where, as usual, the sum is taken over the particles of the body. The quantities X mx\X 2 , 
X 111 x 2 x 3 , X mx 3 X 1 are known as products of inertia. Hence the off-diagonal elements 
of I are the negatives of their corresponding products of inertia.* Our results are sum¬ 
marised as follows: 


Elements of the inertia tensor 

• The diagonal elements of the inertia tensor, namely I\\, I 22 , hz, are the ordi¬ 
nary moments of inertia of the body about the coordinate axes Ox 1 , < 9 * 2 , Ox 3 
respectively. 

• The off-diagonal elements of the inertia tensor are the negatives of their corre¬ 
sponding products of inertia. 


Products of inertia arc new quantities and the reader will probably expect that we 
should now give several examples on their calculation. In fact, we will give just one easy 
one, which is a part of the next example. The reason is that, in almost all problems, 
the products of inertia can be inferred from symmetry and known results. Only a body 
with virtually no symmetry would require products of inertia to be calculated from first 
principles. 


* Some authors include the negative sign in their definition of product of inertia, but this seems unnatural. 
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*3 



FIGURE 18.3 The uniform rectangular plate has mass M and sides 2a and lb 


Example 18.6 Calculating the inertia tensor 

A uniform rectangular plate has mass M and sides 2 a and 2 b as shown in Figure 18.3. 
Find the inertia tensor at the points G and C in the coordinate systems shown. 

Solution 

Consider first the centre of mass point G. Then /n is the moment of inertia of the 
plate about the axis Gx\ which is given in the Appendix (at the end of the book) 
to be | Mb 2 . Similarly, /22 = \Ma 2 and, by the perpendicular axes theorem, 733 = 
/11 + 1 22 = {a 2 + b 2 ). The products of inertia arc all zero. The products Y 111 x 2 x 3 

and 111x3x1 are zero because all the mass lies in the plane X3 = 0. The product 
Y mx 1X2 is zero because of symmetry; the plate is symmetrical about the axis jc 1 = 0 
but the terms of the sum arc odd functions of jci . The contributions to the sum from 
the right and left halves of the plate therefore cancel. Hence I 23 = /31 = /12 = 0. 
Thus, in the given coordinate system, the inertia tensor at G is represented by the 
matrix 


I G = 



(b 2 0 0 \ 

11 ° 2 , 0 J 

V 0 0 a 2 + b 2 J 


Now consider the corner point C. Then /n is the moment of inertia of the 
plate about the axis Cx\ which is given by the parallel axes theorem to be jMb 2 + 
Mb 2 = %Mb 2 . Similarly, I 22 = f A/a 2 and, by the perpendicular axes theorem, 
/ 33 = /j 1 + I 22 = \M(a 2 + b 2 ). The elements I 23 and /31 are zero for the same 
reason as before. However, the product Y mx \ x 2 is not zero now because the plate 
is no longer symmetrically placed relative to the coordinate system based at C. We 
will therefore evaluate this product of inertia from first principles. Let the plate have 
uniform mass per unit area o . Then 

r 2a /»2 b 

V. mx \X 2 = I I ox\X 2 dx\dx 2 = 4a a 2 b 2 = Mab, 

</.vi=0 .7*2=0 
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since M = 4 crab. Hence /12 = Mab. Thus, in the given coordinate system, the 
inertia tensor of the plate at C is represented by the matrix 

j / 4 b 1 -3 ab 0 \ 

Ic = — M I — 3 ab 4 or 0 J . ■ 

3 \ 0 0 4 (a 2 + b 2 )/ 

Example 18.7 Transforming the inertia tensor 

Find Iq in the set of axes obtained by rotating the axes Gx\X 2 X 3 about Gx 3 so that 
the new axis Gx\ lies along the diagonal GD. 

Solution 

In order to achieve the required position, the axes Gx 1 X 2 X 3 must be rotated through 
the acute angle a = tan -1 (b/a). The corresponding transformation matrix is given 
by the formula (18.8) to be 


A = 


cos a sin a 0 
— sin a cos a 0 
0 0 1 


The matrix I ' G representing the inertia tensor at G in the rotated coordinates is given 
by the transformation formula (18.17) to be 


In 


= A•Ig•A 7 



( cos a sin a 0 
— sin a cos a 0 
0 0 1 


(b 2 0 0 \ 

P 2 ° J 

\ 0 0 a 2 + b 2 / 


( cosa — sin a 0 
sin a cos o' 0 
0 0 1 



a 2 sin 2 a + b 2 cos 2 a 
(a 2 — b 2 ) sin a cos a 
0 


(a 2 — b 2 ) sin a cos a 0 

a 2 cos 2 o' + b 2 sin 2 a 0 

0 a 2 + b 2 


M 

3 (a 2 + b 2 ) 


/ la 2 b 2 ab(a 2 — b 2 ) 0 

\ab{a 2 -b 2 ) a 4 + b 4 0 

\ 0 0 (a 2 + b 2 ) 2 


on inserting the values sin a = b/(a 2 + fr 2 ) 1 / 2 and cos o' = a/{a 2 + b 2 )^ 2 . This is 
the inertia tensor of the plate at G in the rotated coordinate system. ■ 


Example 18.8 Finding kinetic energy using the inertia tensor 

Suppose that the plate in Figure 18.3 is made to rotate about one of its diameters with 
angular speed A.. Find its kinetic energy. 

Solution 

The kinetic energy can be evaluated by using either the original or the rotated coordi¬ 
nate system. 
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Expressions for L G and T in principal axes at G 

Lq = (Aoo\)e\ + (Bcoi) ei + (C co^) e$ 

T c = \ (Aa>* + Bcoj+Ccoj) 


(18.33) 


where G is the centre of mass of the body. Here Lq is the angular momentum of the body 
about G, and T c is the rotational kinetic energy of the body about G. 

These are the results that you really need in order to solve problems in rigid body 
dynamics. Almost always, principal axes are used and only the equations (18.32) or 
(18.33) arc required. Thus, it is possible to solve most problems in rigid body dynam¬ 
ics without any knowledge of tensors at all! 

18.7 DYNAMICAL SYMMETRY 

Finding principal axes from geometrical symmetries of the body 

The directions of the principal axes of the inertia tensor of a body at a point O can often 
be inferred from the geometrical symmetry of the body about O. The following rules 
(which are consequences of the tensor transformation formula) are useful. 

Rule 1: If the body has reflective symmetry in a plane through O, then the line through 
O perpendicular to this plane is a principal axis. 

Rule 2: If the body has any rotational symmetry about an axis through O, then this axis 
is a principal axis. 

Either of these rules is enough to show that, for a uniform rectangular plate, the 
axes shown in Figure 18.3 (left) are a set of principal axes at G. The plate has reflective 
symmetry in each of the coordinate planes, and also has rotational symmetry of order two 
about each of the coordinate axes. The set of parallel axes at C is not a principal set. For 
a uniform cone on a pentagonal base (see Figure 18.4 (left)), the axis of symmetry is a 
principal axis at G and there are five other principal axes through G. [What are they?J For 
a spinning top (see Figure 18.4 (centre)), the axis of symmetry is a principal axis at O 
and so is any axis through O that is perpendicular to it! We seem to be finding too many 
principal axes, but this will soon be explained. 

Dynamical symmetry 

Consider a rigid body B pivoted at the origin O. Then the motion of B under known forces 
is detennined by the angular momentum principle, which, in view of equation (18.24), 
can be written 

— (I o • w ) = K o 
dt 

where K q is the total moment about O of the external forces acting on B. In principal 
axes, I o contains only the three principal moments of inertia A, B and C and so the shape 
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of the body enters into the equations of motion only by the values of these three numbers. 
Under the same set of forces and initial conditions, different bodies that have the same 
principal moments of inertia have the same motion. 

The form of the motion, and the chance of solving problems, is very much influenced 
by the presence or absence of dynamical symmetry. There are three cases: 

A, B,C all different 

When {/4, B, C) are all different, the body is said to be dynamically unsvmmetric, and, 
as might be expected, this is the most difficult case to treat analytically. For the rectangular 
plate shown in Figure 18.3, the principal moments of inertia at G are A = | Mb 2 , B = 
\Ma 2 and C = (a 2 + b 2 ), which are all different. Thus, despite the apparently simple 

shape of the plate, it is no easier to treat than a lump of scrap metal. In the unsymmetrical 
case, the principal axes at O are essentially unique.* 

A and B equal, C different 

Suppose now that two of the principal moments of inertia (A and B say) are equal, the 
third, C, being different. The body is then said to be dynamically axially symmetric 
about the axis Ox$. This time, the axis of symmetry is a principal axis, and the other 
two axes can be chosen arbitrarily , provided that they form a right-handed orthogonal set. 
One might expect this to occur only when the body is a body of revolution about an axis 
through O, as in the case of a traditional spinning top. However, the following remarkable 
result shows that this is not necccssary. 

Rule 3: If a body has a rotational symmetry about an axis through O, and the order* of 
this symmetry is three or more, then the body has dynamical axial symmetry about this 
axis. 







* The principal axes are unique in the sense that the three orthogonal unlabelled lines in space are unique. 

There are then 24 ways of labelling these lines to create a right-handed system of coordinates. 
f If the mass distribution of a rigid body is left unchanged when the body is rotated through an angle 2n/n, 
then the body is said to have rotational symmetry of order n about the rotation axis. Thus a uniform 
lamina having the shape of a regular pentagon has rotational symmetry of order five about the axis through 
G perpendicular to its plane. 
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FIGURE 18.5 Three bodies with dynamical spherical symmetry. 


The pentagonal cone in Figure 18.4 has rotational symmetry of order five and so Rule 3 
applies. In particular, it follows that the cone has the same moment of inertia about any 
axis through G pcpendicular to the symmetry axis! 

Interestingly, it is possible for a body to have dynamical axial symmetry and yet have 
no rotational symmetry at all. The body in Figure 18.4 (right) has no rotational symmetry, 
but, since it is made up of three bodies that do have such symmetry, it must be dynamically 
axially symmetric! 

A, B and C all equal 

Suppose now that all three of the principal moments of inertia at O are equal. The body is 
then said to be dynamically spherically symmetric about O. This time, the inertia tensor 
is a multiple of the identity and any axis through O is a principal axis. The principal axes 
can therefore be chosen arbitrarily, provided that they form a right-handed orthogonal set. 
A uniform ball is obviously dynamically spherically symmetric, but there are many other 
examples. This follows from our last rule. Rule 4. 

Rule 4: If a body has two different axes of dynamical axial symmetry through O, then it 
must be dynamically spherically symmetric about O. 

Figure 18.5 shows three bodies that have dynamical spherical symmetry about their 
centres of mass. None of these bodies is a ball, but they move as if they were. For 
example, the uniform tetrahedron has four different axes of dynamical axial symmetry 
through G and so, by Rule 4, it must be dynamically spherically symmetric about G. As 
a consequence, it must have the same moment of inertia about any axis through G. [Don’t 
try proving this any other way!J 
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Problems on Chapter 18 

Answers and comments are at the end of the book. 
Harder problems carry a star (*). 


Orthogonal transformations 

18.1 Show that the matrix 


A 




is orthogonal. If A is the transfonnation matrix between the coordinate systems C and C', do C 
and C' have the same, or opposite, handedness? 


18.2 Find the transformation matrix between the coordinate systems C and C' when C' is 
obtained 

(i) by rotating C through an angle of 45° about the axis Ox 2 , 

(ii) by reflecting C in the plane a *2 = 0, 

(iii) by rotating C through a right angle about the axis OB , where B is the point with 
coordinates (2, 2, 1), 

(iv) by reflecting C in the plane 2x\ — X 2 + 2 x 3 = 0. 

In each case, find the new coordinates of the point D whose coordinates in C are (3, —3, 0). 


18.3 Show that the matrix 



2 -I —2\ 
2 2 1 
1-2 2 / 


is orthogonal and has determinant +1. Find the column vectors V that satisfy the equation 
A • V = V. If A is the transformation matrix between the coordinate systems C and C ', show 

that A represents a rotation of C about the axis OE where E is the point with coordinates 
(1, 1,-1) inC. 

* Find the rotation angle. 


Tensor algebra 

18.4 Write out the transfonnation fonnula for a fifth order tensor. [The main difficulty is 
finding enough suffix names!] 


18.5 In the coordinate system C, a certain second order tensor is represented by the matrix 


T = 



Find the matrix representing the tensor in the coordinate system C ', where C' is obtained 
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(i) by rotating C through an angle of 45° about the axis Ox \, 

(ii) by reflecting C in the plane *3 = 0. 

18.6 The quantities tjjk and w;y*/ are third and fourth order tensors respectively. Decide if 

each of the following quantities is a tensor and, if it is, state its order: 

3 


0) Ujk^lmnp 

(ii) UjkUmn 

(»“) 

; 1 

3 

(* v > t j'j 

3 

(V) Uii 

7 = 1 

3 

( v i) ^ ^ Ujk Mklmn 

K _ 1 

J= ] 

3 3 

(vii) H u "jj 

i= 1 j= 1 

l = 1 

3 

(viii) Uktmn 

k=\ 

K — 1 

(ix) 

,=1 j= 1 k =1 


18.7 Show that the sum of the squares of the elements of a tensor is an invariant. [First and 
second order tensors will suffice.] 

18.8 If the matrix T represents a second order tensor, show that detT is an invariant. |We 
have now found three invariant functions of a second order tensor: the sum of the diagonal 
elements, the sum of the squares of all the elements, and the determinant.] 

18.9 In crystalline materials, the ordinary elastic moduli are replaced by Cijki, a fourth order 
tensor with eighty one elements. It appears that the most general material has eighty one elastic 
moduli, but this number is reduced because Cjju has the following symmetries: 

(•) Cjilcl = Cjjkl (it) Cijlk = Cijki (•!•) Cklij = Cijki 

How many elastic moduli does the most general material actually have? 

Inertia tensor and principal axes 

The following problems do not require moments or products of inertia to be evaluated by integration. 

18.10 Show that I{o,n}-> the moment of inertia of a body about an axis through O parallel to 
the unit vector «, is given by 

ho.n) = n' *Io • n, 

where I <9 is the matrix representing the inertia tensor of the body at O (in some coordinate 
system), and n is the column vector that contains the components of n (in the same coordinate 
system). 

Find the moment of inertia of a uniform rectangular plate with sides 2 a and 2b about a 
diagonal. 

18.11 Find the principal moments of inertia of a uniform circular disk of mass M and radius 
a (i) at its centre of mass, and (ii) at a point on the edge of the disk. 

18.12 A uniform circular disk has mass M and radius a. A spinning top is made by fitting 
the disk with a light spindle AD which passes through the disk and is fixed along its axis of 
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symmetry. The distance of the end A from the disk is equal to the disk radius a. Find the 
principal moments of inertia of the top at the end A of the spindle. 

18.13 A uniform hemisphere has mass M and radius a. A spinning top is made by fitting the 
hemisphere with a light spindle AB which passes through the hemisphere and is fixed along its 
axis of symmetry with the curved surface of the hemisphere facing away from the end A. The 
distance of A from the point where the spindle enters the flat surface is equal to the radius a of 
the hemisphere. Find the principal moments of inertia of the top at the end A of the spindle. 

18.14 Find the principal moments of inertia of a uniform cube of mass M and side 2a (i) at 
its centre of mass, (ii) at the centre of a face, and (iii) at a corner point. 

Find the moment of inertia of the cube (i) about a space diagonal, (ii) about a face diagonal, 
and (iii) about an edge. 

18.15 A uniform rectangular block has mass M and sides 2a , 2b and 2c. Find the principal 
moments of inertia of the block (i) at its centre of mass, (ii) at the centre of a face of area 4 ab. 
Find the moment of inertia of the block (i) about a space diagonal, (ii) about a diagonal of a 
face of area 4 ab. 

18.16 Find the principal moments of inertia of a uniform cylinder of mass M, radius a and 
length 2b at its centre of mass G. Is it possible for the cylinder to have dynamical spherical 
symmetry about G? 

18.17 Determine the dynamical symmetry (if any) of each the following bodies about their 
centres of mass: 

(i) a frisbee, 

(ii) a piece of window glass having the shape of an isosceles triangle, 

(iii) a two bladed aircraft propcllor, 

(iv) a three-bladed ship propel lor, 

(v) an Allen screw (ignore the thread), 

(vi) eight particles of equal mass forming a rigid cubical structure, 

(vii) a cross-handled wheel nut wrench, 

(viii) the great pyramid of Giza, 

(ix) a molecule of carbon tetrachloride. 

18.18* A uniform rectangular plate has mass M and sides 2 a and 4a. Find the principal axes 
and principal moments of inertia at a comer point of the plate. |Make use of the formula for 
I c obtained in Example 18 . 6 , with b = 2a .] 

If you know how, do this question by finding the eigenvalues and eigenvectors of If. If not, try the following 
homespun method: Starting from I c - in the coordinates used in Example 18.6, find I' c in the coordinate 
system obtained by rotating through an angle a around the axis Cx 3 . Then choose or to eliminate the 
off-diagonal elements. 
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Problems in rigid body dynamics 


KEY FEATURES 

The key features of this chapter are the use of the linear and angular momentum principles 
to generate the equations of rigid body motion: the importance of body symmetry in sim¬ 
plifying the problem; and the choice of solution method, vectorial, Lagrangian or Eulerian. 


Readers who have reached this point have come a long way and deserve to be congrat¬ 
ulated. With the linear and angular momentum principles and the inertia tensor behind us, 
we are now able to solve some of the most interesting and puzzling problems in mechan¬ 
ics. Rigid body motion is the pinnacle of achievment* in the ‘problem solving’ kind of 
mechanics but, given the background we now have, many problems are surprisingly easy. 
In this chapter, we classify problems by the dynamical symmetry of the body; first we con¬ 
sider spherically symmetric bodies, then axially symmetric bodies and finally unsym- 
metric bodies. Most problems can be done by either vectorial or analytical methods and 
we choose whatever is appropriate in each case. 

19.1 EQUATIONS OF RIGID BODY DYNAMICS 


Rigid body in general motion 


Translational motion of G : 

(IV 

mbM= f 

(19.1) 


dt 

Rotational motion about G : 

II 

(19.2) 


* More advanced topics exist, but they are more concerned with ‘grand general principles’ than problem 
solving! 
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Summer 


UJ „ 






UJ 



Winter 


Autumn 


FIGURE 19.1 In the motion of the Earth, its angular velocity oo is preserved 
and is not normal to the plane of the orbit. This gives rise to the seasons 
(shown here for the northern hemisphere). 


easier it is to calculate it. We begin with the simplest case, which is when the body has 
dynamical spherical symmetry. Remember that, in order to have dynamical spherical 
symmetry, the body need not actually be spherical; indeed, one can find bodies that have 
no geometrical symmetry at all that are dynamically spherical. 


19.2 MOTION OF ‘SPHERES’ 

Suppose the body has dynamical spherical symmetry about its centre of mass 
G. The body could be a uniform sphere, but it does not have to be. All we require is that 
A = B = C at G, in which case the body has the same moment of inertia A about every 
axis through G. Equation (19.4) then simplifies to give 

Lq = Aco | e\ + Ao> 2«2 + Aa> 3^3 = A (co\ e\ + <^ 2^2 + <^ 3 ^ 3 ) = A a>, 

so that Lq is simply proportional to 00 . Equation (19.2) then becomes 

Aw= K g . 

If the body is moving under no forces, or under uniform gravity, then the total moment 
Kq = 0 and we obtain 


(O = (O 0 , 


a constant. Thus, in either of these cases, the angular velocity of the sphere is preserved 
in the motion. Meanwhile, the motion of G, as determined by equation (19.1), is either 
straight line or parabolic motion respectively. Hence, when one throws a ball (or a cube!), 
G traces out the usual parabola and 00 retains the value that it was given initially. Note 
that to retains its direction as well as its magnitude, so that even though the direction of V 
changes, the rotation axis maintains a constant direction in space. 
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FIGURE 19.2 The snooker hall is a uniform sphere of mass M 
and radius b that moves in contact with a horizontal table. 

The most important instance of this is the motion of the Earth in its orbit. If we assume 
that the Sun and the Earth are spherically symmetric bodies,* then the total moment of 
forces Kq exerted on the Earth by the Sun is zero and co is preserved. Since co is not 
normal to the plane of the orbit, the motion looks like Figure 19.1 (not to scale!), and this 
gives rise to the seasons. 

19.3 THE SNOOKER BALL 

Consider a snooker or pool ball, with mass M and radius b, that moves in contact 
with a horizontal table (see Figure 19.2). First we will consider general motion of the 
ball. In particular, we will allow skidding so that there is no special relation between V , 
the velocity of G, and the angular velocity co. The table is rough + so that the reaction X 
exerted by the table on the ball is not necessarily vertical. The governing equations (19.1) 
and (19.2) become 


MV = X - Mgk 
Lq = (— bk ) xX, 

where Lq = A co. On eliminating the reaction X we find that 

A co = b ^M V + Mg k'j x k 
= MbVxk. 


* This is a very good approximation but (like everything else in astronomy) it is not exact. The Earth is 
slightly spheroidal and the gravitational fields of the Sun and Moon give rise to a small resultant moment 
about G. As a consequence, co actually precesses very slowly around the normal to the plane of the orbit 
(once every 25,730 years). 

' But not so rough that the ball is compelled to roll. 
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On integrating with respect to t, we obtain the non-standard conservation principle* 

Aco + Mbkx V = C, (19.6) 

where C is a constant vector. This is true in any motion of the ball, whether skidding or 
rolling. 

In particular, if we take the scalar product of this equation with k, we obtain the scalar 
conservation principle 


co • k = n, (19.7) 

where n is a constant. Thus the vertical component ofco is conserved in any motion of the 
hall. 

We now examine the special case of rolling. In this case the contact particle C has 
zero velocity, so that the rolling condition is 

V + cox (— hk ) = 0, 


that is, 


V + bkxco = 0. (19.8) 

Now, from the conservation principle (19.6), it follows that 

Akxco = kxC — Mbkx (kxV) 

= kxC — Mb((k • V)k - (k • Ik) V) 

= kxC + MbV , 

since k is a unit vector and is perpendicular to V. On substituting this result into the 
rolling condition (19.8) we obtain 



which shows that V must be constant in any rolling motion. 

The corresponding value of co is 

co = - kx V + n k, (19.9) 

b 

which is also constant. Hence: 


* It is actually the angular momentum principle applied about the non-standard point C. 
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FIGURE 19.3 The free motion of a body with 
dynamic axial symmetry, depicted here as a 
square plate. The principal axis Gx 3 is the 
axis of symmetry and a (= a(t)) is the axial 
unit vector. 


L a 



The only rolling motions that are possible are straight line motions with constant V 
and constant co; this is despite the presence of the vertical angular velocity component n. 

This fact makes the games of pool and snooker possible. In practice, it is difficult to 
strike a ball and not give it some ‘side’ (a non-zero value of n). If rolling took place along 
curved lines when the ball had ‘side’, the game would be impossibly difficult. As it is, 
players often give the ball plenty of‘side’ deliberately. The ball still rolls in a straight line, 
but the ‘side’ affects the bounce of the ball when it hits a cushion; good players use this 
fact to control the ball. As a corollary, it follows that, if a player actually wants to swerve 
a ball, then he must make the ball skid on the table. 


19.4 FREE MOTION OF BODIES WITH AXIAL SYMMETRY 

Bodies with dynamical axial symmetry are the most important objects whose 
motion we study. This is for three reasons: (i) They are much more common than spheri¬ 
cally symmetric objects, (ii) they perform more interesting motions than spherically sym¬ 
metric bodies, (iii) their motions can often be calculated in closed form. 

Consider a rigid body with dynamical axial symmetry. It could be a baseball bat, a 
pencil, a top, a spacecraft, a freely-pivoted gyroscope, or the square plate shown in Figure 
19.3. The centre of mass G of such a body must lie on the axis of symmetry and we 
will suppose that the principal moments of inertia of the body at G are {A, A, C}, with C 
being the moment of inertia about the symmetry axis. C may be greater or less than A. 
For example, if the body is a short fat cylinder, then C > A, but, if the the body is a long 
thin cylinder, then C < A. 

Consider such a body moving under either (i) no forces, or (ii) uniform gravity. 
Then the motion of G is given by particle mechanics and it only remains to calculate the 
rotational motion of the body relative to G . Since K q = 0 in each of these cases, the 
equation of rotational motion is Lq = 0, where, in the principal axes Gx 1 ^ 2 X 3 , 

Lq = Aa>\ e\ + Acoiei + Ceo^e^. (19.10) 
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constant. Hence the axial component of (o is a constant. We will call this axial component 
of io the spin of the body. 

The equation of motion for the body is therefore 


Equation of motion for a free axisymmetric body 

Aaxa + Cna = Lq 


(19.14) 


where the spin n (= to • a) and the angular momentum Lq are constants determined by 
the initial conditions. 

Surprisingly, this equation has a simple exact solution. First, let us write Lq = Lk, 
where L is the magnitude of Lq and k is the unit vector in the same direction as Lq , as 
shown in Figure 19.3. Then, on taking the scalar product of equation (19.14) with a, we 
obtain 


A a • + Cna • a = L(a • k). 


which simplifies to give 


Cn = L(a • k) = L cos or. 


where a is the angle between a and k.* It follows that a is constant and that n, the constant 
axial component of co , is given by 


n 


L cos a 
C 


(19.15) 


Thus the axis of symmetry of the body makes a constant angle with k and so sweeps out a 
cone around the axis {G, k}. 

The progress of the axis of symmetry in time can be found by taking the vector product 
of equation (19.14) with a. This gives 


Aax(axa) + Cna xa = L(axk), 


that is, 


A ((a • a) a — (a • a) a) + 0 = L (a x k). 

Since a is a unit vector, a • a = 1 and a • a = 0 and we obtain 

a = xa ■ (19.16) 

This equation shows that the axis of symmetry of the body precesses around the axis {G, k] 
with constant angular speed L/A. 


* We will suppose that n is positive so that a is an acute angle. 
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FIGURE 19.4 A symmetrical top with its vertex lixed at 0. The top moves under uniform 
gravity and the reaction force X of the table. 

precesses around the axis [G, k} with angular speed L/A. It is this precession that 
is observed as a wobble. The angular momentum magnitude L is not known exactly, 
but it differs only slightly from that in the undisturbed state, namely C£2. Hence 
the precession rate is CQ/A approximately, and this is the angular frequency of the 
wobble. If we regard the frisbcc as a circular disk, then C = 2A and the angular 
frequency of the wobble is 2 £ 2 , approximately twice the spin of the frisbee about its 
axis. ■ 

The above results can also be obtained by using Lagrangian mechanics. The method 
is described in Section 19.6. 

19.5 THE SPINNING TOP 

Consider the symmetrical spinning top* shown in Figure 19.4 (left), which has its 
vertex fixed at the origin O. One may imagine that the vertex of the top is lodged in a 
small smooth pit in the table from which it cannot escape. The top moves under uniform 
gravity and the unknown reaction force X exerted by the table. Since the top has a particle 
fixed at O, we take the equation of motion in the form 


Lq = K o, 


where K o is the moment of the external forces about O. In the present case, 

K o = 0x1 + (ha) x (—Mg k) 

— — Mghaxk, 


* A ‘top’ should be understood to mean any rigid body with dynamical axial symmetry that is pivoted at a 
point on its symmetry axis, and is moving under uniform gravity. 
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where M is the mass of the top and h is the distance OG. 

The angular momentum Lo is calculated exactly in the same way as in the last section. 
The angular velocity oo can be expressed in the form 

w = flx« + Aa, 


and Lq is then given by 


Lo = A a x a + Cka, 

where [A, A, C) are the principal moments of inertia of the top at the vertex O, with C 
being the moment of inertia of the top about its axis of symmetry. The equation of motion 
for the top is therefore 


— (Aaxa + Cka) = —Mghaxk , 
at 


that is 


A a x a + C (ka + ka) = -M gh axk. 

If we now take the scalar product of this equation with a, we obtain 

A a • (a x a) + C ( k(a • a) + k(a • a)) = —Mgh a • (axk). 

The triple scalar products are both zero since each has two elements the same. Also, since 
a is a unit vector, a • a = 1 and a • a = 0. It follows that k = 0, so that k = n, a constant. 
Hence the axial component ofco is a constant. We call this axial component of oo the spin 
of the top. The equation of motion for the top can therefore be written 


Equation of motion for the top 

d , , 

— (Aa x a + C n a) = — Mgh axk 
dt 


(19.18) 


where the spin n (= oo • a) is a constant determined by the initial conditions. 

Steady precession 

We will calculate the general motion of the top in the next section by using the Lagrangian 
method. Here we will investigate only the special, but important, motion called steady 
precession. In this motion, the axial vector a maintains a constant angle a with the vertical 
and precesses round the axis {O, k) with constant angular speed £2, as shown in Figure 
19.4 (right). In this steady precession the rate of change of a is given by 

a = (£2fc) x a. (19.19) 
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We will now investigate whether steady precession of the top can actually occur. From 
equation (19.16), it follows that, in steady precession, 

axa = ax(Qkxa) 

= £2 (a • a ) £ — £2 (a • k)a 
= Q, (k — cos era). 


Hence 

d 

— (axa) = £2(0 — cos era) 
at 

— — £2 coser (£2 k)xa 

= Or cos, a (axk). (19.20) 

On substituting equations (19.20) and (19.19) into the equation of motion (19.18), we 
obtain 

^/4coscr£2 2 — Cn £2 + (axk) = 0. 

This is the condition for steady precssion of the top. Since axk # 0 , steady precession 
can only occur if there are values of cr and £2 that satisfy the equation 

A coscr £2 2 — C /7 £2 + Mgh = 0. (19.21) 

This quadratic equation will have real roots for the angular rate £2 if 

C 2 /? 2 > 4AMgh coser. (19.22) 


If the angle cr is obtuse, then this condition always holds, even when n = 0. However, 
the case in which the top executes a conical type of motion below the pivot O is not 
interesting. The interesting case is that in which cr is an acute angle and the top precesses 
in an ‘upright’ position. Our result is as follows: 

The top can undergo steady precession at an acute angle cr to the upward vertical 
if n (the axial component of (o) is large enough to satisfy the condition (19.22). If this 
condition is satisfied there will generally be two different values of the precession rate for 
each choice of the angle cr. 


Fast and slow precession 

The two solutions of equation (19.21) for the precession rate £2 arc 


£2 /r,s = 


C/7 ± ( C 2 n 2 — 4AMgh coser) 


1/2 


2/4 cos cr 


Cn 


2A coser 


1 ± 1 - 


4AMgh cos cr\ 1/2 


21,2 


C 2 n 


(19.23) 
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FIGURE 19.5 Left: The Euler angles 0, (p and \(/. Right: The corresponding ‘velocity’ diagram. 

19.6 LAGRANGIAN DYNAMICS OF THE TOP 
The Euler angles 

In order to solve problems in rigid body dynamics by Lagrangian mechanics, we need a 
set of generalised coordinates for a generally moving rigid body. The position of the 
centre of mass G can be specified by its three Cartesian coordinates, but the position of 
the body relative to G still needs to be specified. The best practical coordinates for this 
purpose are the Euler angles* 0 , 0 and 0 shown in Figure 19.5 (left). In order to fix the 
position of the body, it is sufficient to specify the positions of two more of its particles, A 
and B say, chosen so that G, A, B do not lie in a straight line. The position of A is fixed 
by the ‘polar angles’ 6 and 0, measured relative to the fixed Cartesian coordinate system 
Gxyz. This does not yet fix the position of B since the rigid triangle GAB can still rotate 
around G A. The position of B becomes becomes fixed when we specifiy the angle 0 
through which the triangle GAB has been rotated from some reference position.^ Since 
the angles 0, 0, 0 are clearly independent variables, it follows that the Euler angles 0, 0, 
0 are a set of generalised coordinates for the rotational motion of a rigid body. 

The Euler angles are a particularly appropriate set of coordinates when the body has 
axial symmetry. In this case it is usual to take the A to be a particle lying on the symmetry 
axis. Then 0 and 0 determine the orientation of the symmetry axis and 0 is the rotation 
angle of the body around the symmetry axis. 

We may now construct the velocity diagram corresponding to this choice of coordi¬ 
nates. When the coordinates 0, 0 , 0 increase individually, each gives the body an angular 
velocity as shown in Figure 19.5 (right). The angular velocity of the body in general 
motion is then obtained by adding these three angular velocities vectorially. 


* After Leonhard Euler (1707-1783), the greatest Swiss mathematician, and the most prolific of all time. 

T The triangle GAB is in its standard reference position when it lies in the plane AGz with B ‘below’ the 
line G A. The configuraton shown in Figure 19.5 thus has a i// value of nearly 2 n. 
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FIGURE 19.6 The angular velocity of the top resolved along 
three perpendicular principal directions. 


Lagrangian for the top 

For a top with its vertex fixed at the origin O, we take a fixed system of Cartesian coor¬ 
dinates Oxyz with Oz vertically upwards and define the Euler angles as shown in Figure 
19.5 (left) with G replaced by O. Since the top has axial symmetry we also take A to lie 
on the symmetry axis, which is a principal axis of the top at O. The top is thus a holo- 
nomic system and can be solved by Lagrangian method. To find the kinetic energy of the 
top in terms of the Euler angles, the angular velocities in Figure 19.5 need to be resolved 
along three perpendicular principal axes, as shown in Figure 19.6.* The kinetic energy 
of the top is therefore given by 

T = \A0 2 + \ A (0 sin#) 2 + \C (0 + 0cos#) 2 , (19.25) 

where {A, A, C} are the principal moments of inertia of the top at O. The gravitational 
potential energy of the top is simply 

V = MgZ = Mg/? cos#, 

where /? is the distance OG. Hence the Lagrangian of the top, with the Euler angles as 
coordinates, is: 


Lagrangian for the top 

L = ^AO 2 + ±A (0 sin#) 2 + jC (0 + 0cos#) 2 - Mg/? cos# 


(19.26) 


Conserved quantities 

Since 0 and 0 appear in the Lagrangian only as 0 and 0, they are both cyclic coordi¬ 
nates. It follows that the generalised momenta p$ and p^ are constants of the motion. On 

* These principal axes are not ‘embedded’ in the top since they do not rotate as x// increases. 
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evaluating p$ (= dL/df) and p^ (= dL/dijs), these conservation principles become 

A(p sin 2 6 + C(rjs + 0 cos 0) cost? = L z , (19.27) 

C(0 + (pcosO) = Cn, (19.28) 

where n and L- are constants, determined by the initial conditions. In fact, L z is the z- 
component of the angular momentum Lo< and n is the axial component of the angular 
velocity a>, the quantity known as the spin of the top.* Without losing generality, we will 
assume that n is positive. 

These two conservation principles, together with the energy conservation equation 

jAd 2 + ^A(<p sint?) 2 + + 0 cost?) 2 + Mgh cost? = E , (19.29) 

are sufficient to determine the motion of the top. The energy conservation equation is 
preferable to the Lagrange equation for the coordinate 6 because it contains only first 
derivatives of the Euler angles. 

The equation for the inclination angle 6 

By making use of the conservation equations (19.27) and (19.28) in the energy equation 
(19.29), we can eliminate 0 and 0 to give 

-> (L 7 — Cn cosd) 2 

AO 2 = 2 E -Cn 2 - — -=- — - 2Mgh cos 6, (19.30) 

A shr 6 

where the constants /?, L z , E are determined by the initial conditions. This ODE for the 
unknown function 0(t) has the form 


. 9 F(cos6) 

0 2 = , . ' 

A 2 sin“ d 

where the function F (w) is defined by 

F(u) = A(2E — Cn 2 )( 1 — w 2 ) — (L- — Cnu) 2 — 2AMghu{\ — u 2 ). 

This equation is similar to the corresponding equation in the theory of the spherical pen¬ 
dulum (see Chapter 11) and we can make similar deductions about the behaviour of the 
coordinate 0. Because the left side of (19.30) is positive, it follows that the motion is 
restricted to those values of 6 that make the function / r (cos6?) positive. Moreover, the 
maximium and minimum values of 6 can only occur when F(6) = 0. 

The equation F(u) — 0 is a cubic in the variable u and the positions of its roots can 
be deduced by considering the sign of F. We have the following information: 


* Note that n — 0 + <p cos 0, not 0. 


Copyrighted material 





538 


Chapter 19 Problems in rigid body dynamics 


FIGURE 19.7 The form of the cubic function 
F(u). The black squares are at its roots. 

(i) As u tends to positive (or negative) infinity, F(u) tends to positive (or negative) infinity. 

(ii) TO) = — {L- — Cn ) 2 and F{— 1) = — (L z + Cn) 2 which are both negativeJ 

(iii) Given that 6>o, the initial value of 0, lies in the range 0 < &o < ^ and that 6 is not zero 
initially, then F(cos9q) must be positive. 

The graph of F(u) must therefore have the form shown in Figure 19.7. It follows that the 
equation F(u) = 0 must have a root w m j n lying in —1 < u < cosOq, a root t/ max lying 
in cos Oq < u < 1, and a third root lying in u > 1, as indicated by the black squares in 
Figure 19.7; this accounts for all three roots. The root greater than unity is not physically 
admissible since it lies outside the range of u when u = cos 0. It follows that, in the 
motion, u must oscillate in the range w m j n < u < M max . Hence, as in the case of the 
spherical pendulum, the inclination angle 6 must perform periodic oscillations between 
two extreme values a and / i. The difference here is that it is possible for both a and (5 
to be acute angles (measured from the upward vertical); in other words, the top can stand 
‘upright’. This oscillation of the inclination angle 0 is called nutation of the top. 

Example 19.3 Finding the range of the inclination angle 

Suppose that the top is released with its axis at rest and making an angle of tt /3 with 
the upward vertical. Find the function F(u). For the case in which C 2 /i 2 = 4 AMgh, 
find the range of values of 0 that occur in the subsequent motion. 

Solution 

On using the initial conditions 6 = 7t/3, 9 = 0, 0 = 0 when / = 0, we find that 
L- = ^Cn and E = n 2 + g/i. The function F (w) therefore reduces to 

F(u) = \ (1 - 2m) [4AMgh(\ - u 2 ) -C 2 n 2 (1 - 2m) J , 

and when C 2 n 2 = A AMgh, this becomes 

F (u) = AM ghu{ 1 — 2w)(2 — u). 



t ^(±1) is zero when L z = ±Cn respectively. This corresponds to a special (and not very important) class 
of motions in which the axis of the top passes through the upward or downward vertical. We exclude these 
cases from consideration. 
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Suppose that C 2 n 2 > 4AM gh so that steady precession can exist at any inclination. Then 
if the initial precession rate £2 is set equal to £ 2 ' s: (the rate of slow steady precession at 
inclination or), the resulting motion of the top is (unsurprisingly) slow steady precession. 
However, if £2 < £ 2 S , it can be shown (by differentiating equation (19.30) with respect to 
t ) that the initial value of 0 is positive so that # initially increases and a is the minimum 
value taken by #. (In other words, the axis of the top falls initially.) In this case, we can 
see from equation (19.32) that 

AQ. sin" a < A(f> sin" 6 < /4£2 sin - a + Cn (cos a — cos ft), 

where ft is the maximum value taken by #. It follows that 

(i) If £2 > 0, then the precession rate 0 is always positive. This is the case shown in Figure 
19.8 (left). The critical case in which £2 = 0 (when the axis is released from rest) is 
shown in Figure 19.8 (centre). 

(ii) There is no reason why the top axis cannot be projected the ‘wrong’ way. If £2 < 0 
(but is not so negative that A £2 sin 2 a + Cn (cos a — cos ft) < 0), then 0 is sometimes 
positive and sometimes negative ! In this case, the path of the axis crosses itself, as 
shown in Figure 19.8 (right). 

(iii) If the value of £2 is so negative that A £2 sin" a + Cn (cos a — cos ft) < 0, then 0 is 
always negative. However, this requires a ‘fast’ precession rate that is unlikely to be 
observed. 

In a similar way, one can show that when £2' s < £2 < £2 F , the axis of the lop rises 
initially and that 0 is always positive. The motion of the axis resembles that shown in 
Figure 19.8 (left). 

Example 19.4 The precession during a period of 6 

In the last example, find the angle through which the top prcccsses during one period 
of the inclination angle 0. 

Solution 

In the last example, the axis of the top is released from rest with # = tt/ 3, and with 
C 2 n 2 = 4AMgh. With ot = n /3 and £2 = 0, the equation for the precession rate 0 
becomes 


Cn( 1 — 2 cos 0 ) 

0 = - ~ -. 

2A sin" 9 

It follows that <J>, the increase in 0 during one period of the coordinate 0 is given by 
<p = 0 (r) — 0 ( 0 ) = f (pdt 

Jo 

Cn f T l—2cos9jf / Mgh\ l ^ 2 f*/ 2 h l — 2cos0\ dO 

2 A Jo sin 2 # \ A ) J n j 3 \ sin 2 # / # 
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The angular velocity of the gyro relative to the frame Gx 1 x 2 x 3 is still given by equation 
(19.33) but now the frame Gx 1 X 2 X 3 is not inertial but rotates with the Earth’s angular velocity 

r7 

ft . This can be written in the form 


ft £ 


= ft £ sinaei + ft £ cos a £3 

= ft b sin a cos 0 e\ — ft £ sin a sin 0 e 2 + ft £ cos a e 3 


(19.36) 


where the angle a is the co-latitude of the location where the gyro is situated. Hence, by the 
addition theorem for angular velocities, the true angular velocity of the gyro (relative to an 
inertial frame) is 


co — (\p e | -T 0 £ 3 ) -T ft £ 

= yxfr + ft £ sin a cos 0 ^ e\ — ^ft £ sin a sin 0 ^ e 2 + (p + ft £ cos e 3 . 

It follows that the true angular momentum of the gyro is given by 

Lq = C ^0 + ft £ sin a cos 0^ e\ — A ^ft £ sin a sin 0^ e 2 + A (p + ft £ cosa^ e 3 . (19.37) 

For the same reasons as before, the applied moment has the form Kq = Ke 2 , where K is the 
unknown moment of constraint. The equation of motion of the gyro is therefore 


d Lq 
dt 

where Lq is given by equation (19.37). Now 


= Ke' 


(19.38) 


dL G 

dt 


= C 


dt 


(if + ft £ sin a cos 0^ e\ + C (rjr + ft £ sin cr cos 0^ 


de\ 

~dT 




sin a sin 



- 4 



sin a sin 


*) 


+ Ape 3 + A 


(p + ft £ cosa^ 


de 3 
~dt’ 


The vectors {e\, e'-,, e 3 } are the unit vectors of the frame Gx'^x' 2 x^ and so their (true) rates of 
change are given by 


de\ 

~dt 


co* x e\, 


de 2 

~dT 


co* x e'o 


de 3 * 

- = CO X £3, 

dt 


where co* is the angular velocity of the frame Gx'^x^x^ relative to an inertial frame. By the 
addition theorem for angular velocities, this is given by 


co* — pe 3 + ft 

= ^ft £ sin a cos 0 ^ e\ — ^ft £ sin a sin 0 ^ e 2 + ^0 + ft £ cos cm £ 3 . 

On combining these results and simplifying, we obtain the three component equations of 
motion for the gyro. As before, the second of these equations serves to determine K and 
the first and third arc 


d 

Tt 



Q. l sin a cos 


s0) 


= 0 , 
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and 

A(p + Q E sin a j^C (^jr + Q. E sin a cos 0^ — AQ. E sin a cos a j sin0 = 0. 

It follows that 

• p 

\fs + £2 sin a cos 0 = n , 

where n is a constant. Hence the spin oo • e\ of the gyro is a constant of the motion. The 
equation for the precession angle 0 then becomes 

A(f> + & E sin a |^Oi — A£l E sin a cos a J sin0 = 0. (19.39) 

In practice, the ratio AQ. E /Cn is very small so that Cn — A£l E sinacosa can be closely 
approximated by Cn. 

Equation (19.39) is the equation for the (large) oscillations of a pendulum about the 
direction 0 = 0, that is, about the axis Gx\. Since this axis points north, it follows that 
the axis of the gyro performs periodic oscillations about the northerly direction. If these 
oscillations are damped (a feature not included in our model), then the axis of the gyro 
will eventually settle pointing north. The gyro does therefore ‘home’ to north and (despite 
statements in the literature to the contrary) does not have to be initially set pointing north. 


Example 19.5 The oscillaton period of the gyrocompass 

Estimate the period of small oscillations of the gyro in a typical case. 

Solution 

In a typical case the ratio /n < 10 -6 so that the equation (19.39) is very accurately 
approximated by 

A0 + (cnQ. E sin sin 0 = 0, 
which, for small oscillations is approximated by 

Af + (cn£l E sin 0 = 0. 

The period of these simple harmonic oscillations is 

yCnQt sin a ) 

With Q. E = 0.000073, n = 50 Hz, C = 2A and a = 45° this gives a period of about 
35 seconds. ■ 


19.8 EULER’S EQUATIONS 

The simplest set of equations for the rotational motion of a rigid body are Euler’s 
equations. They apply to any body, symmetrical or not, but they come with some defi¬ 
ciencies. 
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FIGURE 19.11 Relative to the embedded 
frame, oo and La prcccss around the 
symmetry axis. 


G 


where P and y are constants of integration; the corresponding solution for a >2 is 


a >2 = P cos(£2f + y). 


The general solution of the Euler equations in this case is therefore 


co] = P sin (£2t + y), 
0)2 = P cos (Q.t + y ), 

U) 3 = h, 


where P and y are constants. The corresponding components of the angular momen 
turn vector Lq are 


L i = AP sin (Sit + y), 
L 2 = AP cos (Qf + y), 
Lt, = C 11 . 


The geometrical interpretation of these results is that the vectors e 3, co and Lq all 
lie in the same plane, where they make consant angles wth each other, and that this 
plane rotates around the embedded axis Gx 3 at the constant rate ( A — C)n/A. This 
situation is shown in Figure 19.11 for the case in which A > C. 

Motion of the body 

What we have found is the time variation of co and L as seen from the principal 
embedded frame Gx 1 * 2 * 3 . The true picture (relative to an inertial frame) is quite 
different. In this frame, Lq is constant and the time variation of co and the motion of 
the body have yet to be determined. In general, deducing the motion of the body is 
more difficult than solving the Euler equations. In the present case though, this can 
can be done fairly easily. From the above expressions for co and Lq, it follows that co 
can be written in the form 
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FIGURE 19.12 The axially symmetric body is constrained 
to rotate about the fixed axis PQ which makes an angle ft 
with the axis of symmetry. 

velocity components in the principal embedded frame Gj(|.v 2*3 are 


co 1 = —X sin f 

coi — 0 

003 — A. cos f. 

To find the moment exerted by the pivots, we simply substitute these expressions into the 
left sides of Euler’s equations. This gives 

Ki= 0, K 2 = (A - C)X 2 sinfcosf, K 3 = 0. 

Thus the applied moment is constant in the embedded frame. However, it is not constant in 
an inertial frame. The axis Gx 2 is perpendicular to P Q and rotates around it at a constant 
rate X. The pivots are therefore subjected to oscillating forces with angular frequency X 
and magnitude proportional to X 2 . In such a configuration, the body is said to be dynam¬ 
ically unbalanced. Since G lies on the rotation axis, the body can rest in equilibrium in 
any position, but, when rotating, it exerts oscillating forces on the pivots that may result 
in vibration of the mounting. This is why the wheels of motor cars must be re-balanced* 
when a new tyre is fitted. For a turbine with a mass of many tonnes rotating at 50 Hz say, 
extremely accurate alignment is needed, for otherwise the forces needed to restrain the 
turbine would be so large that it would break loose its mountings! 

19.9 FREE MOTION OF AN UNSYMMETRICAL BODY 

The free motion of unsymmetrical bodies is an important topic in space technol¬ 
ogy because of its application to the tumbling motion of spacecraft and space satellites. 


* This is done by adding lead weights at determined positions around the rim of the wheel. 
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FIGURE 19.13 The unsymmetrical body 
moves under no forces with G at rest. 


x 1 



cv 



Such motions arc considerably more difficult to analyse than for bodies with axial sym¬ 
metry. The reader may be surprised then to learn that there is an exact analytical solution 
to the problem of an unsymmetrical body moving under no forces (see Landau & Lifshitz 
[ 6 ], Chapter VI). However, since this solution is expressed in terms of an impressive array 
of elliptic functions and theta functions, it gives one almost no idea of what the motion is 
actually like! Instead, we will use Euler’s equations combined with geometrical argu¬ 
ments. This enables us to make interesting predictions about the motion without too much 
analysis. 

Rotation of a free body about a fixed axis 

Suppose a free unsymmetrical body B (an asteroid, for example) is observed from an 
inertial frame in which its centre of mass G is at rest. Is it possible for the body to rotate 
about a fixed* axis? 

This is one of the few questions about unsymmetrical bodies that can be answered 
quite easily. If such an axis does exist, then it must pass through the fixed point G. Let 
Gx 1 ^ 2 X 3 be a set of principal axes of B at G. Since B is unsymmetrical, these principal 
axes are unique (apart from labelling) and are therefore necessarily embedded in B\ Euler’s 
equations therefore apply. 

The kinetic energy of B is given by T = \l |o>| 2 , where / is the moment of inertia of B 
about the rotation axis and a> is its angular velocity. Since B is rotating about a fixed axis, 
/ is a constant. Also, since the motion takes place under no forces, the kinetic energy T is 
constant. It follows that the angular speed | a>\ must be constant and that the components 
{co \, a» 2 , 0 ) 3 } of (o in the embedded principal frame Gx\X 2 Xt, must be constants. Hence 
d>\ — 6)2 = o >3 = 0 and Euler’s equations then imply that 


(B — C) 0 ) 20 ) 2 , = (C — A)o) 2 ,o)\ = (A — B)o)\o) 2 = 0, 


* This means an axis fixed in space and fixed in the body. 
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FIGURE 19.14 The L-sphere and the curves 
in which it meets different T -ellipsoids for 
the case A < B < C. 



which integrates to give 


l] + l\ + l\ = l 2 . 


(19.45) 


where L is a constant. Similarly, if we multiply the first equation by L\, the second by 
L 2 , the third by L 3 and add, we obtain 


BC L \ L \ T AC LjLj T AB L 3 Z .3 — 0, 


which integrates to give 



(19.46) 


where T is a constant. The first of these integrals means that the magnitude of Lq is 
conserved* and the second means that the kinetic energy of the body is conserved. 

The equations (19.45), (19.46) place geometric restrictions on the time variation of 
Lq, as seen from the embedded frame. Equation (19.45) means that, if Lq is considered 
to be the position vector of a ‘point’ L in (L\, L 2 , Z, 3 )-space, then that point must move 
on the surface of a sphere with centre O and radius L, which we will call L -sphere. In the 
same way, equation (19.46) means that the L-point must also move on the surface of the 
ellipsoid ! in (L 1 , L 2 , Z. 3 )-space defined by equation (19.46). We will call this ellipsoid, 


* Note that, in the embedded frame, the individual components L\, /- 2 . Z .3 are not conserved , but the 
magnitude + Lyj is. 

T An ellipsoid is the three-dimensional counterpart of an ellipse. The standard equation of an ellipsoid with 
semi-axes a, b,c is 
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which has the semi-axes ( 2AT )'/ 2 , ( 2BT)^ 2 , ( 2CT)^ 2 , the T -ellipsoid. It follows that 
the path of the L-point must be a curve of intersection of the L-sphere and the T-ellipsoid. 
Figure 19.14 shows the L-sphere for a fixed value of the parameter L. The paths on its 
surface arc examples of the curves in which it is cut by the T -ellipsoid for different values 
of the parameter T. 

Paths of the L-point 

Suppose that A < B < C and that L is fixed. For each value of L, values of T that are 
too small or too large are not admissible. Indeed we can see from equation (19.46) that 


L 2 L 2 

Tmin = 2£ — ^ — 2j\ = ^ max ' 


If T lies outside this range, then the L-sphere and the T -ellipsoid do not intersect. 

(i) When T = Tmin the intersection consists of the two isolated points (0, 0, ±L). The 
constant solution L = (0, 0, L) corresponds to steady rotation in the positive sense 
about the principal axis Gx 3 . The solution L = (0, 0, —L) corresponds to a steady 
rotation in the opposite sense. 

(ii) Suppose now that T is slightly increased. The intersection point (0, 0, L) then becomes 
a small closed curve* on the surface of the L-sphere that encircles the point (0, 0, L) 
as shown in Figure 19.14. The L-point must therefore move along this curve. The 
direction of motion of the L-point can be deduced by examining the signs of L 1 and 
L 2 given by the first and second equations of (19.44). As T increases, the intersection 
curve becomes larger (and changes its shape) but it still encircles the point (0, 0, L). 
This pattern continues until T = L 2 /2B. 

(iii) When T = L 2 /2B. the L-sphere and the T -ellipsoid touch at the points (0, ±L, 0) and 
the intersection curves arc the dashed lines in Figure 19.14. This is a transitional case 
between two different regimes of curves. It includes the case of steady rotation about 
the principal axis Gx 2 . 

(iv) When T > L 2 /2B , the intersection curves arc completely different. Now they encircle 
the point (L, 0, 0) (or (—L, 0, 0)) and, as T approaches ^max they shrink to zero around 
these points. The case T = r max corresponds to steady rotation about the principal axis 
Gx 1 . 


Stabilty of steady rotation about a principal axis 

We can make some interesting deductions from the form of the above paths. 

(i) In the general motion, the L-point moves around a closed curve. It follows that the 
time variation of L, as seen from the embedded frame, is periodic, a property not 
apparent from equations (19.44). This does not however mean that the motion of the 
body is periodic. 

(ii) If the body is in steady rotation in the positive sense about the axis Gx\ and is slightly 
disturbed, then the path of L remains close to the point (L, 0, 0). This implies that, 


* The projection of this curve on to the (,V|, * 2 )-plane is a small ellipse with centre the origin. 
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in an inertial frame where L is constant, the axis Gx\ remains close to L. In other 
words, the original steady rotation about the axis Gx\ is stable to small disturbances. 
The same applies to a steady rotation about the axis Gx 3 . However, if a steady rotation 
about the axis Gx 2 is slightly disturbed, the paths of the L-point lead far away from 
the original constant positions at (0, ±L, 0). This implies that a steady rotation about 
the axis Gx 2 is unstable to small disturbances. Hence: 

The steady rotation of an unsymmetrical body about a principal axis is stable for the 
axes with the greatest and least moments of inertia, but unstable for the other axis. 

Thus, it is useless to try to stabilise a satellite in steady rotation about the principal axis 
with the ‘middle’ moment of inertia. 

(iii) The stability argument above is modified if the body has a means of energy dissipation. 
This cannot happen with rigid bodies, but real bodies, such as a satellite, can flex and 
slowly dissipate mechanical energy as heat. In this case, steady rotation of the body 
about the axis with the least moment of inertia (which corresponds to T = T mM ) 
is no longer stable. The ‘radius’ of the path of L around the axis Gx\ will slowly 
increase until T = L 2 /(2B) when it will switch to a large path encircling one of the 
points (±0, 0, ±L). This path will then gradually shrink to zero. Thus, in the presence 
of energy dissipation, the body will end up rotating about the axis with the largest 
moment of inertia. In the early days of space exploration* this fact was learned by hard 
experience! 

Motion of the body 

Suppose that Euler’s equations have been solved for the unknowns co\, (02, cot,. What is 
the corresponding motion of the body? Let [e \, ej, e?,} be the unit vectors of the principal 
embedded frame Gx 1 X 2 X 2 . Then the time variation of these vectors, as seen from an 
inertial frame, is given by 

e\ = (ox e\, t >2 = (oxe, es = cox e^. 

Unfortunately, these equations are not really uncoupled since Euler’s equations give us a> 
in the form 


co = a>\ e\ + a>2 £2 + cot, £3, 


which also involves the unknown vectors {e\, e2, 03 }. On combining these equations, we 
get the set of coupled ODEs 


€ 1 = CO 3#2 “ &>2 £3 9 

e 2 — a>\ C 3 - me\, (19.47) 

€3 = <*> 2^1 — 0)\ ei. 


* Kaplan [5] recounts the following incident: The first U.S. satellite, Explorer I, was set into orbit rotating 
about its longitudinal axis, which was the principal axis with the least moment of inertia. After only a 
few hours, radio signals indicated that a tumbling motion had developed and was increasing in amplitude 
in an unstable manner. It was concluded that the satellite’s flexible antennae were dissipating energy and 
causing a transfer of body spin from the axis of minimum inertia to a transverse axis of maximum inertia. 
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for the unknown vectors {e\, ei, £3}. These equations cannot usually be solved explicitly, 
but they are suitable for numerical integration (see Problem 19.17) and for approximate 
solution by perturbation analysis. This is illustrated by the following example. 

Example 19.7 Motion of a principal axis in space 

An unsymmetrical body is in steady rotation about the principal axis with the largest 
moment of inertia. Find an approximation to the wobble of this principal axis if the 
body is slightly disturbed. 

Solution 

Suppose that A < B < C and that, in the initial motion, co\ = 0)2 = 0 and cot, = A, 
where A is a constant. In the disturbed motion we suppose that a>\, C 02 (and their time 
derivatives) remain small. If we now apply Euler’s equations and then linearise, we 
obtain 


Aco\ — ( B — C)Au>2 = 0, 
B C 02 — (C — A) A cof = 0, 
C cot, = 0, 


Hence, in the linear approximation to the disturbed motion, the coupled equations for 
co\, C 02 arc easily solved to give 


co x = eA ((C - fl)//4) 1 / 2 cos(£ 2 / + y), 
a >2 = e A ((C — A)/B) 1 ^ 2 sin(£21 + y), 


where e, y are dimensionless constants and 


£2 = A 


/(C -A)(C -fi )\ ,/2 

V AB ) 


The corresponding expressions for L \, L 2 are 

L { = fA (A(C - £)) ,/ 2 cos(£ 2 t + y), 
L 2 = fA (B(C - A )) 1/2 sin(£ 2 t + y). 


(19.48) 


(19.49) 


Hence the projection of the path of the L-point on to the (xj, X 2 )-plane is an ellipse 
executed in the anti-clockwise sense. This is entirely consistent with the paths shown 
in Figure 19.14 and our discussion of stability. 

Now to find the actual time variaton of the unit vector e 3 . In the initial motion 
we have 


ex = cos At i + sin At j, 
e 2 = — sin At i + cos At j, 

<?3 = k , 

where {/, j, k} is a fixed orthonormal set. Since cox, <*>2 are small in the disturbed 
motion, the vectors e\ and e2 that appear in the third equation of (19.47) can be 
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on the ( x , v)-plane. The thick dashed curve is the path of the point of contact C. The 
vector a (= a(t)) is the axial unit vector and the vector n (= n(t)) is the unit vector in 

the direction CG. Since the disk has axisymmetry, the angular velocity of the disk can be 
expressed in the form 


(o = axa + Xa. (19.50) 

where the spin X (= (o • a) is an unknown scalar function of the time. The corresponding 
angular momentum Lq is then given by 

L c — Aa x a + CXa. (19.51) 

where { A , A , C } are the principal moments of inertia of the disk at G. 

The disk moves under the forces shown in Figure 19.15 (right). It follows that the 
equation of translational motion of the disk is 

dV 

M — =X-Mgk, (19.52) 

dt 

where V is the velocity of G, and that the equation of rotational motion is 

d . . 

— (Aaxa + CXa) = (-bn)x X . (19.53) 

In addition, we have the rolling condition that the particle of the disk in contact with the 
floor has zero velocity, that is 


V T (o x (—bn) = 0. 


(19.54) 


Equations (19.52)-( 19.54) are the governing equations of the motion. On eliminating X 
and V, we obtain the equation 

^ (Aaxa + CXa ) + Mb 2 [co — (do • n)n — (a> • n)h ] + Mgbnxk = 0. (19.55) 

where a> is given by equation (19.50). The vectors a and n are not independent. In fact n 
can be expressed in terms of a and k in the form 


k - (k • a) a 
k — (k • a)a\ 


(19.56) 


Thus it is possible in principle to eliminate n from equation (19.55) and obtain a vector 
equation solely in terms of the unknown axial vector a and the spin X. This equation is 
sufficient to determine a and X. 

All this begins to look easier when we take components of the vector equation (19.55) 
in three well chosen directions, namely, the (instantaneous) principal directions a, n and 
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Hence, in this linear approximation, A. = A, a constant. The second equation gives 

A<() + C A 0 = constant, 

and, in order to be satisfied by the unperturbed state, the constant must be zero. On 
substituting this into the third equation, we obtain 

A(A + Mb 2 ) 6 + [a 2 C(C + Mb 2 ) - AMgb ] 0 = 0. (19.61) 

If the expression in the square brackets is positive , this is the SUM equation and 0 
will perform small oscillations about 0 = 0; the same is then true of 0. Hence, the 
condition for stability of straight line rolling is 

A 2 C(C + Mb 2 ) - AMgb > 0, 


that is 


2 AMg&_ 

C(C + Mb 2 )' 

where V is the speed of the disk. For a hoop with A = i Mb 2 and C = Mb 2 , this 
condition becomes V 2 > gb/4. ■ 

Path of the point of contact 

If r is the position vector of C , the point of contact, then 

r = R — bn. 


where R is the position vector of G. It follows that 


dr 

It 


= V -bit. 


where V is the velocity of G. On using the rolling condition (19.54) and the expression 
(19.50) for c o , the differential equation for r becomes 


— = b \ (a x it) x n + ka x n — it 1. 
dt L J 


(19.62) 


This is the path equation for the point of contact. Once the equations (19.60) have been 
solved for A., 0 and 0, the right side of (19.62) is known and the path of C can then be 
found by integration. Numerical integration shows that some possible paths can be quite 
exotic, as shown in Figure 19.16. 


Stability of the bicycle 

Our analysis of the rolling wheel gives some idea of the complexity of the mechanics of 
the bicycle. Indeed, the question of stability is not completely settled and rival sets of 
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FIGURE 19.16 Two not-so-typical paths for the point of contact of a rolling wheel. 

equations appear in the literature. Probably the most important paper on bicycle physics is 
that by Jones.* Jones constructed a non-standard bicycle in which the angular momentum 
of the wheels (the ‘gyroscopic effect’) was cancelled out by fitting extra wheels that spun 
in the opposite direction. It turned out that this bicycle was just as stable as a normal 
one and could be ridden ‘no hands’. Despite Jones’s findings, the myth that a bicycle is 
stabilised by the the gyroscopic effect of the spinning wheels still persists. However, the 
concensus of informed opinion is that the stability of a bicycle depends crucially on the 
geometry of the forks holding the front wheel. Jones constructed another non-standard 
bicycle in which the front wheel was four inches ahead of its normal position; it was 
almost unridable! The website http://ruina.tam.cornell.edu/research/index.htm 
has references to further interesting work on bicycle physics. 


Problems on Chapter 19 

Answers and comments are at the end of the book. 

Harder problems carry a star (*). 

Rolling balls 

19.1 Ball rolling on a slope A uniform ball can roll or skid on a rough plane inclined at an 
angle f) to the horizontal. Show that, in any motion of the ball, the component of co perpendic¬ 
ular to the plane is conserved. If the ball rolls on the plane, show that the path of the ball must 
be a parabola. 

19.2 Ball rolling on a rotating turntable * A rough horizontal turntable is made to rotate 
about a fixed vertical axis through its centre O with constant angular velocity Q.k, where the 
unit vector k points vertically upwards. A uniform ball of radius a can roll or skid on the 
turntable. Show that, in any motion of the ball, the vertical spin co • k is conserved. If the ball 
rolls on the turntable, show that 


V = fakxV, 


* Jones, D.E.II., The stability of the bicycle. Physics Today 23 (1970) pp. 34-40. 
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where V is the velocity of the centre of the ball viewed from a fixed reference frame. Deduce 
the amazing result that the path of the rolling ball must be a circle. 

Suppose the ball is held at rest (relative to the turntable), with its centre a distance b from 
the axis {O, &}, and is then released. Given that the ball rolls, find the radius and the centre of 
the circular path on which it moves. 

19.3 Ball rolling on a fixed sphere * A uniform ball with radius a and centre C rolls on the 
rough outer surface of a fixed sphere of radius b and centre O. Show that the radial spin go • c 

is conserved, where c (= c(t)) is the unit vector in the radial direction OC . [Take care!] Show 
also that c satisfies the equation 

7 (a + b)cx 'c 2 an c + 5 g c x k = 0, 

where n is the constant value of go • c and k is the unit vector pointing vertically upwards. 

By comparing this equation with that for the spinning top, deduce the amazing result that 
the ball can roll on the spherical surface without ever falling off. Find the minimum value of 
n such that the ball is stable at the highest point of the sphere. 

Axisymmetric bodies 

19.4 Investigate the steady precession of a top for the case in which the axis of the top moves 
in the horizontal plane through O . Show that for any n ^ 0 there is just one rate of steady 
precession and find its value. 

19.5 The sleeping top By performing a perturbation analysis, show that a top will be stable 
in the vertically upright position if 

C 2 // 2 > 4 AM gh, 

in the standard notation. [You can do this by either the vectorial or Lagrangian method.] 

19.6 Estimate how large the spin n of a pencil would have to be for it to be stable in the 
vertically upright position, spinning on its point. |Take the pencil to be a uniform cylinder 15 
cm long and 7 mm in diameter.] 

19.7 A juggler is balancing a spinning ball of diameter 20 cm on the end of his finger. Esti¬ 
mate the spin required for stability (i) for a uniform solid ball, (ii) for a uniform thin hollow 
ball. Which do you suppose the juggler uses? 

19.8 Solve the problem of the free motion of an axisymmetric body by the Lagrangian 
method. Compare your results with those in Section 19.4. (Surprisingly awkward. Take 
the axis Gz of coordinates G.xyz to point in the direction of Lq and make use of the Lagrange 
equations for the Euler angles (p and \f/.] 

19.9 Frisbee with resistance A (wobbling) frisbee moving through air is subject to a fric¬ 
tional couple equal to K go. Find the time variation of the axial spin A. (= go • a), where a is the 
axial unit vector. Show also that a satisfies the equation 

Aaxa + A'axa+CA.fl = 0. 
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grad operator 
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law of, 5H 
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Hamilton form, 395, 396 
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significance of, 387 
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Hamilton, Sir William Rowan , 367, 
402 
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Hooke’s law, 106 
Hooke, Robert , 106 
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integrable systems, 273, 273-277 
invariance 

of potential energy 
under rotation, 358 
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E-formula for, 172 
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Kepler’s laws, 156 
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laboratory (lab) frame, 259. 
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partial derivatives in, 336 
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summary of method, 339 
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Lagrange, Joseph-Louis , 323 
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410 
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proving existence of, 206-209 
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Liouville’s theorem, 409 
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Liouville, Joseph , 409, 417 
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aphelion of, 162 
apogee of, 162 
perigee of, 162 
perihelion of, 162 
orbital motion 

angular momentum equation for. 159 
effective potential in, 161 
energy equation for, 160 


general nature of, 159-164 
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path equation for, 165, 164-167 
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particle, 51 

perturbation expansion 

Linstcdt’s method, 197-199 
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(jc, u)-plane, 202 
(x\ , * 2 )-plane, 20 Q 
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recurrence theorem, 411 
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two-body formula, 272 


SI units, 51 

scalar (invariant), 5M 

scattering cross section, 181 
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general relation satisfied by, 132 
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dynamics of, 525-527 
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speed, 222 
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solved analytically, 535-541 
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non-linear 

hardening, 194 
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and conservation principles, 356-361 
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outer product of, 505 
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in matrix form, 503 
summation convention for, 502 
third order, 502 
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three-body problem, 417 
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torque, 286 

transformation matrix, 494 
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two-body problem, 264-269 
reduced mass in, 266 
relative motion equation, 266 
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cross section formula for, 271 
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variation. 371 
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norm of, 371 

variational principles, 380-388 
vector algebra, 3-16 
vector calculus, 16-18 
vector geometry, 8-10 
vector quantity, 2 
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as first order tensors, 501 
basis set of, 6-8 
components of, 12, 12-13 
difference of, 5 
differentiation of, Jiu 16-18 
scalar multiple of, 5* 5-6 
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triple scalar product of, _L5 
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unit, .6 

vector product of, _13. 13-14 
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rectilinear, 26 
vector, 28 

velocity diagram(s), 273, 
273-274 
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principle of, 335 
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subject many students find difficult. The author’s clear and systematic style promotes a 
good understanding of the subject: each concept is motivated and illustrated by worked 
examples, while problem sets provide plenty of practice for understanding and technique. 
Computer assisted problems, some suitable for projects, are also included. The book is 
structured to make learning the subject easy; there is a natural progression from core 
topics to more advanced ones and hard topics are treated with particular care. A theme 
of the book is the importance of conservation principles. These appear first in vectorial 
mechanics where they are proved and applied to problem solving. They reappear in ana¬ 
lytical mechanics, where they are shown to be related to symmetries of the Lagrangian, 
culminating in Noether’s theorem. 
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• Profusely illustrated and thoroughly class-tested, with a clear direct style that makes the 
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Bloody instructions , which , being taught , 
Return to plague th’inventor. 

SHAKESPEARE, Macbeth , act I, sc. 7 


Front Cover The photograph on the front cover shows Mimas, 
one of the many moons of Saturn; the huge crater was formed 
by an impact. Mimas takes 22 hours 37 minutes to orbit Sat¬ 
urn, the radius of its orbit being 185,500 kilometers. After reading 
Chapter 7, you will be able to estimate the mass of Saturn from 
this data! 
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Preface 


Information for readers 


What is this book about and who is it for? 

This is a book on classical mechanics for university undergraduates. It aims to cover 
all the material normally taught in classical mechanics courses from Newton’s laws to 
Hamilton’s equations. If you are attending such a course, you will be unlucky not to 
find the course material in this book. 

What prerequisites are needed to read this book? 

It is expected that the reader will have attended an elementary calculus course and an 
elementary course on differential equations (ODEs). A previous course in mechanics 
is helpful but not essential. This book is self-contained in the sense that it starts from 
the beginning and assumes no prior knowledge of mechanics. However, in a general 
text such as this, the early material is presented at a brisker pace than in books that are 
specifically aimed at the beginner. 

What is the style of the book? 

The book is written in a crisp, no nonsense style; in short, there is no waffle! The object 
is to get the reader to the important points as quickly and easily as possible, consistent 
with good understanding. 

Are there plenty of examples with full solutions? 

Yes there are. Every new concept and technique is reinforced by fully worked exam¬ 
ples. The author’s advice is that the reader should think how he or she would do each 
worked example before reading the solution; much more will be learned this way! 

Are there plenty of problems with answers? 

Yes there are. At the end of each chapter there is a large collection of problems. For 
convenience, these are arranged by topic and trickier problems are marked with a star. 
Answers are provided to all of the problems. A feature of the book is the inclusion 
of computer assisted problems. These are interesting physical problems that cannot be 
solved analytically, but can be solved easily with computer assistance. 

Where can I find more information? 

More information about this book can be found on the book’s homepage 

http://www.cambridge.org/Gregory 

All feedback from readers is welcomed. Please e-mail your comments, corrections and 
good ideas by clicking on the comments button on the book’s homepage. 
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